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In  recent  years,  optical  second  harmonic  generation 
( SHG )  has  been  shown  to  provide  valuable  new  information 
on  a  number  of  interfacial  properties. 1  Since  SHG  is  electric 
dipole  forbidden  in  the  bulk,  this  technique  can  be  used  to 
probe  selectively  the  interfacial  layer  between  two  centro- 
symmetric  media.  Thus  a  wide  variety  of  surfaces  have  been 
studied  by  this  technique;  a  substantial  amount  of  knowl¬ 
edge  has  been  gathered  about  concentration,  orientation, 
and  spectroscopy  of  the  species  at  the  interfaces. 2‘13  In  our 
earlier  work,  we  focused  our  attention  on  the  relative12  and 
absolute1'  orientation  of  molecules  at  the  surface  of  liquid 
solutions.  We  now  address  the  question  of  how  the  asymme¬ 
try  in  the  forces  experienced  by  the  molecules  in  the  interfa¬ 
cial  region  affects  simple  chemical  equilibria.  Specifically, 
using  this  technique  we  have  investigated  the  acid-base  equi¬ 
librium  between  ^-nitrophenol  ( HA )  and  its  anion  ( A  "  ). 

OH  0<_) 

0  -  H20  ^ 

no2  no2 

As  we  will  show,  not  only  do  the  concentrations  of  the  var¬ 
ious  species  in  the  interfacial  region  differ  from  their  bulk 
values,  but  also  the  relative  concentrations  do  not  corre¬ 
spond  to  the  bulk  equilibrium  constant.  The  "equilibrium 
constant"  in  the  surface  region  is  vastly  different  from  that  in 
the  bulk.  The  key  factor  is  most  likely  the  reduced  polarity  at 
the  surface  due  to  the  low  density  of  molecules  on  the  vapor 
side  of  the  surface.  The  effect  of  this  is  to  decrease  the  stabil¬ 
ity  of  charged  species,  e  g..  A  " ,  to  such  an  extent  that  the 
equilibrium  in  the  interfacial  region  overwhelmingly  favors 
the  neutral  undissociated  HA  molecules.  It  is  to  be  noted 
that  although  the  concentrations  are  dependent  on  their  po- 
sition.  varying  in  a  continuous,  though  very  sharp  way.  from 
the  bulk  vapor  to  the  bulk  liquid,  we  can  still  define  an  aver¬ 
age  concentration  for  each  of  the  species  in  this  inhomogen¬ 
eous  region.  We  then  seek  to  compare  the  relative  average 
concentrations  of  the  species  in  the  interfacial  region,  e  g., 
A  "/HA,  with  the  relative  concentrations  given  by  the  bulk 
equilibrium  constant  In  order  to  treat  the  position-depen¬ 
dent  concentrations  (activities)  in  the interfacial  region,  we 
would  use  the  chemical  potential,  since  its  value  for  a  given 
species  is  the  same  at  al!  positions  in  the  system,  i.e.,  bulk 
liquid,  vapor,  and  the  inhomogenous  interfacial  region.  Dis¬ 
cussions  of  this  approach  will  be  postponed  to  a  later  publi¬ 
cation. 

The  experimental  setup  consisting  of  a  cavity-dumped, 
synchronously  pumped  dye  laser,  and  single  photon  count¬ 
ing  electronics  is  described  elsewhere."  The  component  of 
the  surface  second  order  nonlinear  susceptibility  y;2’  per¬ 


pendicular  to  the  plane  of  incidence  was  measured  by  setting 
the  analyzer  to  collect  only  the  s-polarized  SH  light.  Such  an 
arrangement  is  selected  because  it  contains  the  contribution 
from  the  surface  region  alone  and  not  the  bulk  region.2 

The  observed  r-polarized  SH  field  amplitude  ( E  )  as  a 
function  of  bulk  pH  of  the  solution  is  shown  in  Fig.  1.  We 
found  in  earlier  work13  that  up  to  a  monolayer  the  nonlinear 
susceptibility  j'*2’  can  be  written  as 

£2>  =  A,<S<2>>,  (2) 

where  Ni  is  the  number  of  molecules  per  unit  area  of  the 
surface  anda12’  the  molecular  polarizability.  From  this  rela¬ 
tion  it  follows  that  the  second  harmonic  field  E  f"  is  linear  in 
N, .  As  the  bulk  />H  is  varied,  the  relative  amount  of  HA  and 
A-,  as  given  by  the  bulk  equilibrium  constant  for  reaction 
( 1 )  changes.  This  results  in  a  change  in  N,  as  the  latter  de¬ 
pends  on  bulk  concentrations.  At  low  pH,  the  bulk  concen¬ 
tration  of  HA  has  its  largest  value  and  this  leads  to  a  high 
surface  coverage  of  neutral  HA  molecules.  In  accordance 
with  this  an  intense  signal  ( about  80  times  that  of  water )  was 
observed.  In  extremely  alkaline  media  (/>H  >  10 ) ,  the  bulk 
HA  molecules  are  almost  totally  converted  into  the  anion 
and  we  found  that  the  SH  signal  decreased  to  the  value  ob¬ 
tained  from  pure  water.  If  there  was  a  significant  anion  con¬ 
centration  at  the  surface  we  would  readily  observe  it  since  its 
SH  signal  should  be  more  than  10  times  that  of  water.  The 
possibility  of  undissociated  sodium  phenolate  formed  by- 
neutralization  with  NaOH  being  present  at  the  surface  is 


FIG  1  s-polanzed  surface  second  harmonic  field.  E  [  v  /  i  lu>)  ]  of  an 
aqueous  solulion  ofpnilrophenol  as  a  function  of  pH  al  2J  ’C 
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also  unlikely  since  its  presence  would  have  produced  a  sec¬ 
ond  harmonic  signal  much  above  that  of  water.  We  therefore 
conclude  that  the  anion  once  formed  does  not  stay  at  the 
surface  and  thus  in  alkaline  media  the  interfacial  region  is 
made  up  of  water  molecules  alone.  From  our  second  har¬ 
monic  measurements  we  estimate  that  the  ratio  of  A  "  to  HA 
is  at  least  50-100  times  smaller  in  the  interfacial  region  than 
in  the  bulk  solution.  The  equilibrium  in  the  interfacial  region 
is  thus  clearly  shifted  towards  the  neutral  molecules  and 
differs  markedly  from  its  bulk  value. 

Although  Fig.  1  is  similar  to  a  neutralization  curve  the 
pH (7.9  ±  0.2)  at  which  the  SH  signal  amplitude  reduces  to 
half  its  maximum  value  is  different  from  the  pH(7.15)  at 
which  the  concentration  of  undissociated  HA  in  the  bulk 
decreases  to  half.  This  is  not  surprising  since  the  bulk  pH 
controls  the  concentration  of  HA  in  the  bulk,  while  the  sur¬ 
face  concentration  ,V,  is  not  a  simple  linear  function  of  the 
latter.  This  point  will  be  pursued  in  our  future  work.  It  is 
furthermore  of  interest  to  note  the  sensitivity  of  surface  sec¬ 
ond  harmonic  generation  as  demonstrated  by  these  results. 
Under  the  conditions  described  in  this  work,  when  the  SH 
signal  changes  by  a  factor  of  80.  we  found  that  the  surface 
tension  changed  by  less  than  10%. 
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IR  diode  laser  study  of  vibrational  energy  distribution  in  C02  produced 
by  UV  excimer  laser  photofragmentation  of  pyruvic  acids) 
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Time  domain  absorption  spectroscopy  using  a  tunable,  infrared  diode  laser  has  been  used  to 
monitor  the  vibrational  excitation  of  CO,  produced  in  the  193  nm  excimer  laser  photolysis  of 
gas  phase  pyruvic  acid  (CHjCOCOOH).  Nascent  vibrational  populations  were  measured  in 
the  following  ten  vibrational  states  of  C02:  00°0,  01  '0,  02 20,  0330,  0440,  00°  1,  00°2,  00°3,  01 1 1, 
and  0221.  Approximately  97%  of  the  CO,  photoproduct  is  observed  to  be  directly  formed  in 
the  vibrational  ground  state.  The  remaining  molecules  are  formed  with  a  significant  degree  of 
vibrational  excitation,  having  mode  temperatures  7"(v2)  =  1800  ±  150  K,  T(v 2)  =  3700 
±  1000  K,  T(  v2  +  v3)  =  2000  ±  400  K.  The  present  experimental  data  suggest  that  the  193 
nm  photolysis  may  proceed  through  more  than  a  single  dissociation  channel  and  involve  a 
number  of  different  photofragments. 


INTRODUCTION 

In  recent  years,  the  photodissociation  dynamics  of  small 
molecules  has  become  a  subject  of  great  interest.  Measure¬ 
ments  of  fragment  energy  distributions  and  angular  orienta¬ 
tions  have  yielded  a  wealth  of  state-specific  experimental  in¬ 
formation  about  these  dissociation  processes.  Such 
information  can  provide  insight  into  the  geometry  of  the  dis¬ 
sociating  species  and  the  nature  of  the  potential  energy  sur¬ 
faces  involved.5'5  In  addition,  theoretical  models  involving 
“half-collision"  arguments  and  molecular  dynamics  on  po¬ 
tential  energy  surfaces  have  become  quite  refined.  The  vast 
majority  of  this  experimental  and  theoretical  work  has 
sought  to  provide  a  detailed  interpretation  of  the  dissocia¬ 
tion  dynamics  of  predominantly  diatomic  and  triatomic  spe¬ 
cies,  while  the  photolysis  of  most  larger  polyatomic  mole¬ 
cules  is  less  well  understood. 

Of  particular  importance  in  understanding  the  dynam¬ 
ics  of  photofragmentation  processes  are  experimental  stud¬ 
ies  of  vibrational  energy  disposal  in  the  photoproducts.  Such 
experiments  yield  information  about  mode-specific  excita¬ 
tion  of  photolysis  products,  which  can  in  some  cases  be  ex¬ 
trapolated  back  to  provide  a  better  understanding  of  the  dis¬ 
sociation  mechanism. "■*  A  variety  of  techniques,  including 
infrared  fluorescence,  laser-induced  fluorescence,  and  in¬ 
frared  laser  absorption,  have  been  employed  to  probe  these 
product  state  energy  distributions. 1  A  new  technique,  which 
is  presently  being  developed  in  our  laboratory  to  investigate 
chemical  dynamics  and  molecular  energy  transfer,  utilizes  a 
tunable  diode  laser  absorption  probe.  This  technique  has 
been  employed  to  monitor  the  time  resolved  IR  absorption 
of  C02  molecules  undergoing  collisions  with  fast  hydrogen 
and  deuterium  atoms  as  well  as  with  electronically  excited 
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mercury,  NO,,  and  azulene.9'14  The  high  resolution 
( ~  0.0003  cm  - 1 )  and  the  wide  tunability  of  the  diode  laser 
make  possible  vibrationally  and  rotationally  state-specific 
studies  in  a  wide  variety  of  molecules.  In  this  experiment, 
virtually  any  rovibrational  state  of  CO,  can  be  probed  by 
monitoring  transitions  of  the  type 

CO 2(mn‘p,  J)  -+-  hv(4.3  /am)  —CO .(mn'p  +  \J  ±  1 ) 

(1) 

which  share  the  large  absorption  coefficient  of  the  v3  funda¬ 
mental.15  The  letters  m,  n,  p,  and  J  are  (respectively)  the 
quantum  numbers  for  the  symmetric  stretching  ( v, ),  bend¬ 
ing  (v2),  and  antisymmetric  stretching  (v3)  vibrations,  and 
the  rotational  motion  of  the  molecule 

In  the  present  experiments,  the  UV  photolysis  of  pyru¬ 
vic  acid  was  investigated  by  monitoring  the  IR  absorption  of 
its  C02  dissociation  product.  Previous  photodissociation 
studies  in  both  the  gas  and  liquid  phases  have  shown  that 
pyruvic  acid  undergoes  both  thermal  and  photochemical  de¬ 
composition  to  yield  predominantly  acetaldehyde  and 
CO,16~:i: 

CHjCOCOOH  +  hv(  193  nm )  -CH.CHO  +  CO Amnp). 

(2) 

In  one  experiment  which  examined  the  vapor  phase  photoly¬ 
sis  of  pyruvic  acid  at  366  nm,  small  amounts  ( l%-2% )  of 
methane  and  carbon  monoxide  were  measured  among  the 
photolysis  products.10  Several  mechanisms  for  the  dissocia¬ 
tion  of  this  molecule  have  been  proposed.  One  of  these  in¬ 
volves  a  four-center  mechanism  which  leads  to  the  direct 
production  of  acetaldehyde  and  CO;.  while  another  involves 
a  five-center  process  which  yields  an  hydroxycarbene  inter¬ 
mediate  species.  A  recent  study  which  measured  the  4.3  /<m 
CO,  IR  fluorescence  as  a  function  of  dissociation  wave¬ 
length  supports  the  five-center  mechanism  involving  the  hy¬ 
droxycarbene  intermediate.20  In  another  experiment,  mea¬ 
surements  of  the  infrared  multiphoton  decomposition  of 
pyruvic  acid  suggest  that  the  dissociation  process  occurs  by 
ummolecular  decomposition  in  high  vibrational  levels  of  the 
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ground  electronic  state.22  The  UV  photolysis  of  pyruvic  acid 
is  also  thought  to  proceed  through  an  internal  conversion 
step  wnere  highly  vibrationally  excited  ground  state  mole¬ 
cules  decompose  to  form  acetaldehyde  and  C02.20 

In  a  previous  study,  we  measured  the  re1.-  ive  produc¬ 
tion  of  C02  bending  and  antisymmetric  stretching  quanta 
formed  in  the  193  nm  photolysis  of  pyruvic  acid  by  probing 
the  01 ‘0— 01 ‘1  and  00°  1—  00°2  transitions  in  CO:.21  The 
present  experiments  extend  this  work  by  measuring  the  com¬ 
plete  distribution  of  vibrational  energy  deposited  in  the  CO, 
bending,  stretching,  and  combination  state  manifolds.  We 
have  examined  such  overtone  levels  as  0220, 03  50, 04*0, 00°2, 
00°3,  01  *1,  and  0221,  as  well  as  the  00°0  ground  state.  In 
addition,  we  present  preliminary  information  concerning 
the  degree  of  rotational  excitation  in  the  C02  photofrag¬ 
ment. 

EXPERIMENTAL 

The  experimental  apparatus  employed  in  this  study  is 
similar  to  that  used  in  several  previous  IR  diode-UV  ex- 
cimer  laser  double  resonance  experiments  carried  out  in  our 
laboratory.'1"11'14  Most  of  the  experimental  details  have  been 
discussed  previously,  and  only  new  features  will  be  described 
here.  The  photolysis  mixture,  consisting  of  either  neat  pyru¬ 
vic  acid  or  a  mixture  of  pyruvic  acid  and  argon,  flows 
through  a  cylindrical  gas  sample  cell  207  cm  long  (see  Fig. 
1 ) .  The  4.3  n m  cw  output  of  a  tunable  IR  diode  laser  ( Laser 
Analytics)  and  the  193  nm  pulsed  output  of  a  UV  excimer 
laser  (Lambda  Physik)  are  collinearly  propagated  through 
the  cell  by  the  use  of  an  IR  transparent  MgF,  beam  splitter 
coated  for  high  reflectivity  at  193  nm.  The  excimer  laser  is 
operated  in  an  unstable  resonator  configuration  which  mini¬ 
mizes  the  divergence  of  the  UV  beam  in  the  far  field  and 
allows  for  the  propagation  of  the  photolysis  beam  over  the 
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FIG  1  A  schematic  diagram  of  the  excimer/diode  laser  double  resonance 
apparatus.  The  LTV  photolysis  source  is  a  pulsed  excimer  laser  operating  at 
193  nm,  and  the  IR  probe  source  is  a  tunable  dtode  laser  emitting  at  approxi¬ 
mately  4  3  Jim.  A  portion  (  —  8% )  of  the  diode  laser  probe  beam  is  diverted 
through  a  high  voltage  dc  discharge  cell  used  to  locate  the  high  vibrational 
states  of  CO,  which  are  excited  through  an  "anharmonic  pumping1’  pro¬ 
cess.  The  discharge  cell  is  also  used  as  a  reference  cell  in  conjunction  with  a 
lock-in  amplifier  to  frequency  stabilize  the  diode  laser.  The  IR  probe  beam 
is  mode  selected  with  a  monochromator  and  detected  with  a  liquid  nitrogen 
cooled  IR  detector. 


long  distances  (5-7  m)  necessary  for  the  present  experi¬ 
ments.  Time-dependent  changes  in  the  absorption  of  the  di¬ 
ode  laser  beam  caused  by  C02  produced  in  the  photolysis 
process  are  monitored  with  a  liquid  nitrogen  cooled  InSb 
detector  and  matched  amplifier  ( 300  ns  response  time ) .  The 
signals  are  acquired  with  a  Biomation  8 100  transient  record¬ 
er  and  signal  averaged  with  a  Nicolet  1 170  averager. 

Using  a  207  cm  cell  at  a  CO,  pressure  of  10  Torr,  where 
pressure  broadening  begins  to  make  the  direct  spectral  ob¬ 
servation  of  weak  absorption  features  difficult,  the  present 
apparatus  is  able  to  detect  the  presence  of  molecules  in  states 
whose  ambient  population  at  room  temperature  is  about 
10  "s  of  the  ground  state  population.  In  order  to  locate  and 
identify  transitions  corresponding  to  high  lying  vibrational 
levels  which  have  extremely  small  ambient  populations  at 
room  temperature,  a  portion  (  —  8% )  of  the  IR  beam  is  split 
off  and  directed  through  a  CO,  discharge  reference  cell.21 
This  cell  consists  of  a  high  voltage  dc  discharge  applied  to  a 
low  pressure  mixture  of  CO;,  N,,  and  He.  It  provides  a 
steady  state,  non-Boltzman  population  of  highly  vibration- 
ally  excited  C02  with  effective  vibrational  mode  tempera¬ 
tures  as  high  as  2900  K.24  We  present  in  Fig.  2  typical  ab¬ 
sorption  spectra  observed  through  the  auxiliary  cell  with  the 
discharge  both  on  and  off-.  Note  that  transitions  from  high 
lying  vibrational  levels  such  as  00°3  —  00H4  are  readily  ob¬ 
served  when  the  discharge  is  applied.  This  is  remarkable  be¬ 
cause  there  is  less  than  1  molecule/cmJ  in  a  given  rotational 
level  of  the  00°3  vibrational  state  for  a  1  Torr  sample  at  300 
K.  Using  the  discharge  cell,  we  have  been  able  to  probe  CO, 
vibrational  levels  as  high  as  0550  and  OO^.  The  reference  cell 
is  also  used  to  actively  stabilize  the  diode  laser  frequency 
onto  a  specific  absorption  line  during  the  experiment.  The 
reference  signal  is  sent  to  a  lock-in  amplifier  whose  deriva¬ 
tive  output  is  fed  back  to  the  diode  laser  current  controller. 
This  procedure  “locks”  the  laser  frequency  to  the  peak  of 
any  given  CO,  absorption  line  appearing  in  the  discharge 
cell. 
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FIG.  2  Typical  CO:  absorption  spectra  using  the  CO:  discharge  reference 
ceil  Top  trace:  Fringe  pattern  of  a  solid  Ge  etalon  used  for  frequency  refer¬ 
encing  (  free  spectral  range  -0.0475  cm  1 )  Middle  trace  CO.  absorption 
spectrum  m  ihe  vicinity  of  2283  cm  1  observed  through  the  reference  cell 
with  the  discharge  applied  (N:  CO.  He  —  4  J  7;  P,M  =  10  Ton:  discharge 
current  =  25  r  \  )  Note  the  intensity  of  the  absorption  feature  correspond¬ 
ing  to  the  transition  originating  on  the  high  lying  (XT' 3  vibrational  state 
Lower  trace:  CO:  absorption  spectrum  in  the  same  spectral  region  with  the 
discharge  turned  off. 
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In  order  to  account  for  the  shot  to  shot  instabilities  of 
our  excimer  laser  photolysis  source,  the  average  pulse  inten¬ 
sity  is  measured  by  diverting  a  portion  of  the  UV  beam  to  a 
Gentec  power  meter  and  accumulating  the  signals  with  a 
signal  averager.  The  data  is  subsequently  normalized  by  this 
factor. 

The  pyruvic  acid  used  in  these  experiments  (Aldrich 
98%)  was  purified  by  repeated  fractional  distillation  under 
partial  vacuum  conditions.  Sample  purity  was  checked  with 
NMR  spectroscopy.  Pyruvic  acid/argon  samples  were  ob¬ 
tained  by  combining  a  concentrated  mixture  of  pyruvic  acid 
vapor  in  argon  ( made  by  flowing  argon  across  a  reservoir  of 
the  liquid  acid)  with  a  stream  of  pure  argon.  By  varying  the 
relative  flow  rates  of  the  two  streams,  a  variety  of  sample 
concentrations  could  be  achieved  while  maintaining  a  total 
pressure  of  5  Torr  in  the  cell.  The  partial  pressure  of  pyruvic 
acid  in  each  mixture  was  determined  by  comparing  the  tran¬ 
sient  absorption  signal  of  the  mixture  with  signals  obtained 
using  known  pressures  of  pure  pyruvic  acid  in  the  cell.  The 
sample  flow  speed  was  sufficiently  rapid  to  prevent  the 
buildup  of  photoproduct  contaminants  in  the  cell.  It  is  par¬ 
ticularly  important  to  avoid  the  accumulation  of  vibrational- 
ly  relaxed  CO;  molecules  whose  significant  room  tempera¬ 
ture  populations  in  the  00°0  and  01 ‘0  levels  can  distort 
measurements  of  the  nascent  populations  in  these  levels. 

In  a  typical  experiment,  the  change  in  the  diode  laser 
absorption  (mn!pj)  —  (mn'p  +  1,7+1)  is  measured  fol¬ 
lowing  the  UV  photolysis  of  pyruvic  acid/argon  mixtures. 
Of  particular  interest  is  the  prompt  rise  in  signal  intensity 
which  immediately  follows  the  excimer  laser  pulse  (20  ns 
_ I 


width).  The  amplitude  of  this  detector  limited  rise  is  a  mea¬ 
sure  of  the  number  of  C02  molecules  found  in  the  (mn'pj) 
level  after  a  time  interval  of  300  ns,  the  response  time  of  the 
detection  system.  The  sample  pressure  and  composition  are 
chosen  specifically  to  influence  the  dynamics  occurring 
within  this  time  period.  For  example,  the  partial  pressure  of 
pyruvic  acid  is  kept  at  40-50  mTorr  in  order  to  assure  ade¬ 
quate  signal  intensity  while  minimizing  the  partial  pressure 
of  the  C02  photoproduct.  This  ensures  that  vibrational  re¬ 
laxation  within  each  C02  vibrational  mode  (which  occurs 
within  2-25  CO--CO-  collisions)  is  insignificant  within  the 
rise  time  of  the  detector.  In  addition,  excess  argon  ( typically 
5  Torr)  is  added  to  the  pyruvic  acid  in  order  to  ensure  com¬ 
plete  rotational  relaxation  in  each  vibrational  level  of  the 
nascent  C02  molecules  within  the  rise  time  of  the  signal.  As 
a  result,  the  population  within  a  particular  vibrational  level 
may,  in  principle,  be  obtained  by  measuring  only  one  absorp¬ 
tion  line  out  of  that  level  since  the  relative  intensities  of  the 
absorption  lines  are  given  by  the  rotational  Boltzman  distri¬ 
bution.  However,  the  vibrational  populations  reported  in  the 
next  section  are  extracted  from  measurements  taken  on  two 
or  more  absorption  lines  per  vibrational  state,  each  of  which 
was  found  to  yield  similar  results  for  the  population  of  the 
entire  vibrational  level. 

RESULTS 

Immediately  following  the  193  nm  photolysis  of  pyruvic 
acid,  transient  IR  absorption  signals  were  observed  while 
monitoring  the  following  CO:  transitions: 


00°0-00°1  mo): 

01  ’0  —  01 1 1  />(49),/5f35),/>(30); 
02:0  — 02:1  P(}7).P[22).P(  17): 
0330  —  03*1  /’(24),/>(8),F(33): 
04J0  —  04J 1  P(  10)./>(20): 
10"0-10'”i  P( 48); 


00"  1  —  00°2  P(  1 3).P(  17).P(  37); 
00"2— 00°3  P(  10),/? ( 14),R(20); 
00°3  —  00°4  R(23),R(25); 

01 1 1  —01  '2  P(8).P(24); 

02: 1  — 02:2  P(l 0); 

00o0-00ol  R(40),R(48)  ,5CO;. 


Our  investigation  focuses  on  the  bending  ( v:)  and  asymme¬ 
tric  stretch  ( v, )  vibrational  modes  of  CO; .  These  modes  are 
only  weakly  coupled  to  each  other,  requiring  approximately 
30  000  CO;-CO:  collisions  for  intermode  equilibration.25  26 
Rovibrational  levels  in  the  bending  and  symmetric  stretch 
( v, )  modes,  on  the  other  hand,  are  strongly  coupled  via  Fer¬ 
mi  resonances  and  easily  reach  vibrational  equilibration 
within  the  rise  time  of  the  detector.  As  a  result,  these  experi¬ 
ments  do  not  differentiate  between  the  nascent  populations 
formed  in  such  states  as  !0°0,  02“0,  and  0220.  Measurements 
of  the  vibrational  populations  for  the  10°0  and  0220  states 
yield  a  ratio  of  N]n„o/N02;0  -  0.40  ±  0.10.  which  is  in  excel¬ 
lent  agreement  with  the  value  of  0.39  predicted  by  Boltz¬ 
mann  statistics  at  room  temperature.  This  indicates  that  the 
Fermi-mixed  v,  and  v;  vibrational  states  have  attained  ther¬ 
mal  equilibrium  through  collisions  with  argon  within  the 
response  time  of  the  detector.  Nevertheless,  intramode  vi- 


I - ' 

brational  equilibration  (i.e  .  between  01 '0.  02;0.  03 ‘0.  etc.) 
does  not  occur  on  this  time  scale. 

The  absorption  signals  obtained  by  our  apparatus  are  a 
measure  of  the  difference  in  population  between  the  upper 
and  lower  levels  of  the  IR  transition  In  order  to  determine 
the  nascent  population  in  a  particular  vibrational  level 
(mn'p),  it  was  necessary  to  take  into  account  the  nascent 
population  in  the  upper  level  (mn  p  —  1 )  .  In  the  present  ex¬ 
periments,  contnbutions  by  the  upper  state  populations  to 
the  signals  from  low  lying  vibrational  levels  such  as  00' 0. 
01  '0, 02;0, 00“  1,  and  00°2  were  determined  directly  by  prob¬ 
ing  ( respectively)  the 00°  1. 01 M, 02;1. 00"2.  and 00' 3  states. 
For  higher  vibrational  levels,  it  was  necessary  to  estimate  the 
contribution  of  the  upper  state  by  extrapolating  from  the 
measured  data.  Since  the  nascent  populations  were  found  to 
fall  off  rapidly  with  increasing  energy,  the  upper  state  contri¬ 
butions  were  usually  negligible 
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FIG.  4.  Transient  absorption  signals  obtained  while  monitonng  the 
02^— 02:1  PI  22 )  transition  in  CO,  following  the  193  nm  photolysis  of  pyr¬ 
uvic  acid.  These  curves  illustrate  the  effect  of  added  Ar  on  the  rotational 
equilibration  process  which  is  responsible  for  feeding  population  into  this 
state.  As  the  pressure  of  Ar  is  increased,  the  initial  change  in  absorption  of 
the  diode  laser  becomes  more  rapid  because  the  rate  of  rotational  equilibra¬ 
tion  is  increased. 


FIG  3  Transient  absorption  signals  obtained  while  monitonng  the  lowest 
bending  and  asymmetnc  stretching  levels  of  the  CO,  photofragment  The 
prompt  nse  in  the  signal  corresponds  to  the  direct  formation  of  CO.  mole¬ 
cules  in  the  rovibrational  level  being  probed  by  the  diode  laser.  The  initial 
population  of  CO.  found  in  the  00“1  level  is  substantially  smaller  than  that 
found  in  Ol'O.  but  the  long  time  00"  1  population  increases  dramatically  due 
to  colltsional  excitation 


Bending  excitation  in  the  C02  photoproduct 

Transient  absorption  signals  were  obtained  while  moni¬ 
toring  the  Ol'O.  02:0,  03’0.  and  04J0  bending  mode  vibra¬ 
tions  in  the  CO,  fragment.  The  01  '0  signal,  shown  in  Fig.  3, 
exhibits  a  detector  limited  rise  (immediately  following  the 
excimer  laser  pulse)  corresponding  to  the  C02  molecules 
which  are  formed  directly  or  relax  into  the  01  '0/  =  30  state. 
Following  the  prompt  rise,  we  observe  more  complex  tempo¬ 
ral  behavior  which  reflects  the  subsequent  relaxation  during 
collisions  with  other  species  present  in  the  sample  cell.  Qual¬ 
itatively  similar  temporal  profiles  were  observed  in  all  of  the 
bending  level  signals. 

Evidence  of  rotational  excitation  within  the  02‘0  vibra¬ 
tional  state  is  seen  in  the  02:0  P{  22 )  and  P(  37 )  signals  ( not 
shown )  for  pure  pyruvic  acid  at  pressures  below  0. 1  Torr.  At 
pressures  as  low  as  40  mTon\  the  P(  37)  transition  displays 
an  initial  nse  which  is  detector  limited.  The  P(  22)  absorp¬ 
tion  signal,  on  the  other  hand,  is  not  detector  limited  and 
rises  at  a  rate  which  is  dependent  on  the  total  gas  pressure. 
This  suggests  that  CO,  molecules  which  are  produced  in  the 
0220  pure  bending  state  are  also  formed  with  significant  rota¬ 
tional  excitation.  The  lower  rotational  levels  are  filled  during 
subsequent  rotational  relaxation.  For  states  such  as  0220, 
vibrational  energy  transfer  from  levels  10°0,  02°0  can  also 
occur  as  a  result  of  collisions  as  noted  above.  The  effect  of 


increased  argon  pressure  on  the  initial  rise  time  of  the  02:0 
P(  22 )  absorption  signal  is  shown  in  Fig.  4.  The  addition  of  5 
Torr  of  argon  was  found  to  ensure  complete  rotational  relax¬ 
ation  within  the  rise  time  of  the  detector.  Subsequent  mea¬ 
surements  of  other  CO,  vibrational  levels  were  carried  out 
under  these  conditions. 

In  Fig.  5  we  present  the  relative  nascent  vibrational  pop¬ 
ulations  measured  for  the  01  '0.  02:0.  0330.  and  04J0  bending 
states  of  CO,.  The  logarithm  of  the  vibrational  populations  is 
seen  to  decrease  linearly  with  the  vibrational  energy,  sug¬ 
gesting  a  mode  "temperature”  for  the  bending  levels  which  is 
significantly  hotter  than  300  K.  By  fitting  a  line  to  the  levels 
with  bending  excitation,  we  find  that  the  effective  mode  tem¬ 
perature  is  almost  1800  K.  While  mode  temperatures  nor¬ 
mally  include  the  ground  state,  our  results  (below)  suggest 
that  the  vibrationally  excited  CO,  photoproduct  appears  to 
be  the  result  of  a  dissociation  process  which  is  different  from 
that  which  forms  ground  state  CO,  molecules.  As  a  result, 
the  two  populations  must  be  treated  separately. 

Antisymmetric  stretching  excitation  in  the  C02 
photoproduct 

Transient  absorption  signals  were  obtained  while  moni¬ 
toring  the  00°  1,  00°2,  and  00°3  antisymmetric  stretching 
states  of  CO,  following  the  photodissociation  pulse.  The 
00°1  signal,  shown  in  Fig.  3,  displays  a  small,  detector  limit¬ 
ed  nse  followed  by  a  much  slower,  large  amplitude  increase 
The  fast  nse  is  caused  by  CO,  molecules  which  are  directly- 
produced  with  antisymmetric  stretching  vibration  as  a  result 
of  the  photolysis  process.  The  slower,  large  amplitude  nse  is 
due  to  energy  transfer  into  the  asymmetnc  stretch  mode  as  a 
result  of  collisions  with  hot  photofragments.  The  addition  of 
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FIG  5.  Nascent  distribution  of  population  among  the  various  vibrational 
states  of  the  CO;  photoproduct  immediately  following  the  photolysis  pulse 
Note  the  large  abundance  of  molecules  which  are  directly  formed  in  the 
ground  state  as  well  as  the  hot  mode  temperatures  which  are  suggested  by 
the  distribution  of  molecules  among  the  vibrationally  excited  states  of  C0;. 
The  v,  and  v:  modes  in  CO;  are  tightly  coupled  through  the  Fermi  reso¬ 
nance  pairs  present  in  the  molecule  Thus,  with  the  large  excess  of  Ar  pres¬ 
ent  in  these  experiments,  it  is  not  possible  to  obtain  unambiguous  results  for 
the  nascent  populations  in  such  levels  as  10°0,  02°0.  and  02‘0  which  are 
rapidly  equilibrated  by  CO;-Ar  collisions  during  the  rise  time  of  the  detec¬ 
tion  system. 


argon  to  pure  pyruvic  acid  caused  no  change  in  the  ampli¬ 
tudes  or  rates  of  these  features,  suggesting  that  rotational 
relaxation  does  not  play  a  significant  role  in  these  signals. 
The  nascent  vibrational  population  of  each  of  the  antisym¬ 
metric  stretching  states  is  shown  in  Fig.  5.  Note  that  the 
absolute  number  of  molecules  directly  produced  in  these  lev¬ 
els  is  much  smaller  than  that  found  in  the  vibrational  states 
of  the  bending  manifold,  but  the  population  distribution 
within  the  stretching  mode  is  substantially  hotter  than  the 
bending  states,  having  an  effective  mode  temperature  of 
3700  K. 

Bend/stretch  excitation  in  the  C02  photoproduct 

Excitation  in  bend/stretch  combination  states  of  the 
C02  photoproduct  was  observed  by  measuring  the  transient 
absorption  signals  for  the  01 1 1  —01 '2  and  0221  —  022  2  tran¬ 
sitions.  These  signals  demonstrate  temporal  behavior  which 
is  qualitatively  similar  to  that  shown  by  the  pure  antisymme¬ 
tric  stretching  levels  (see  Fig.  3 ),  showing  an  initial  prompt 
rise  followed  by  a  slower,  large  amplitude  increase  due  to 
vibrational  energy  transfer  from  hot  photofragments.  The 
nascent  populations  for  the  01 '1  and  0221  level,  shown  in 
Fig.  5,  have  an  effective  vibrational  temperature  of  approxi¬ 
mately  2000  K.  This  value  lies  between  the  mode  tempera¬ 
tures  of  the  pure  bending  levels  and  the  pure  asymmetric 
stretch  states,  just  as  the  combination  state  vibrational  popu¬ 
lations  lie  in  between  the  pure  bend  and  asymmetric  stretch 
populations. 


Formation  of  ground  state  COx  (00°0)  photoproduct 

The  production  of  C02  in  the  vibrational  ground  state 
was  measured  by  probing  the  00°0—  00°1  transition.  Since 
the  amount  of  C02  present  in  the  atmosphere  is  sufficient  to 
cause  complete  absorption  of  the  diode  laser  beam  at  fre¬ 
quencies  near  the  peak  of  the  room  temperature  rotational 
distribution,  we  found  it  necessary  to  monitor  the 
00°0— 00°  1  transitions  of  IJC02  during  the  photolysis.  The 
small  ( 1  % )  natural  abundance  of  this  species  limits  atmo¬ 
spheric  absorption  of  the  diode  laser  and  allows  examination 
of  rotational  levels  close  to  the  peak  of  the  room  temperature 
Boltzmann  distribution.  We  assume  that  the  dynamics  of  the 
dissociation  process  involving  l5CO;  are  the  same  as  those 
involving  the  lighter  l2CO:  species.  In  theory,  the  vibration¬ 
al  population  can  also  be  measured  by  monitoring  a  very 
high  rotational  line  in  the  ground  state  of  l2CO:.  However, 
this  latter  method  involves  the  extraction  of  an  entire  vibra¬ 
tional  population  from  a  weak  absorption  line  located  on  the 
wings  of  the  rotational  distribution,  and  as  a  result,  involves 
a  great  deal  of  experimental  uncertainty. 

The  l3CO;  00u0  —  00°  1  absorption  signals  exhibit  an  ini¬ 
tial  prompt  rise  which  has  a  very  large  amplitude.  Relative 
population  measurements  show  that  the  number  of  mole¬ 
cules  directly  formed  in  this  state  during  pyruvic  acid  photo¬ 
lysis  is  30  +  6  times  that  produced  in  the  01  '0  bending  state 
(see  Fig.  5).  In  other  words,  95%-98%  of  all  CO;  photo¬ 
fragments  are  formed  in  the  ground  vibrational  level.  This 
result  is  in  excellent  agreement  with  our  previous  study 
which  showed  (by  absolute  population  measurements  rath¬ 
er  than  the  relative  measurements  presented  here)  that  only 
3%  of  the  nascent  C02  molecules  are  found  in  the  01  '0  lev¬ 
el.21  In  addition,  an  initial  prompt  rise  (which  was  subse¬ 
quently  relaxed  by  the  presence  of  excess  argon )  was  seen  in 
rotational  levels  as  high  as  l2CO,  00°0y  =  60.  This  suggests 
a  high  degree  of  rotational  excitation  in  CO,  molecules 
which  are  produced  in  the  ground  state  during  the  photoly- 


DISCUSSION 

There  are  several  interesting  features  of  the  pyruvic  acid 
photodissociation  process  which  can  be  compared  to  the 
data  obtained  in  the  present  experimental  study.  Figure  b 
shows  an  energy  diagram  for  the  photofragmentation  event 
Overall,  the  production  of  carbon  dioxide  and  acetaldehyde 
from  pyruvic  acid  is  exothermic  by  5  kcal/mol.  This  energy 
(  —  1750  cm-1)  is  quite  small  compared  to  that  available 
from  193  nm  photolysis  (152  kcal/mol  or  53  150  cm-1) 
because  of  the  high  energy  of  the  UV  photon.  Even  if  the 
products  initially  produced  by  photodissociation  are  the  hy- 
droxycarbene  intermediate  and  carbon  dioxide,  there  is  still 
a  huge  amount  of  energy  left  over  ( 20  650  cm  "  1 )  to  be  divid¬ 
ed  among  the  fragments.  The  present  data,  which  show  that 
more  than  95%  of  the  CO,  molecules  are  produced  in  the 
vibrational  ground  state,  leave  this  rather  large  energy  unac¬ 
counted  for. 

Simple  geometric  considerations  suggest  that  the  CO. 
molecules  produced  in  the  photofragmentation  process 
should  have  large  amounts  of  bending  energy.  The  CO,  cen- 
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FIG  6.  An  energy  level  diagram  indicating  the  energetics  of  the  193  nm 
photolysis  of  pyruvic  acid.  The  final  CO:  and  acetaldehyde  products  are 
exothermic  by  5  keal/mol  The  formation  of  a  transient  hydroxycarbene 
intermediate  species  along  the  dissociation  pathway  requires  approximately 
54  keal/mol  of  the  total  147  keal/mol  which  is  available  tn  the  photolysis 
process 


ter  in  the  acid  is  sp:  hybridized  with  a  120  deg  angle  for  the 
O-C-O  bond,  while  free  C02  has  linear,  sp  hybridization. 
Assuming  that  the  H  atom  on  the  acid  center  is  transferred 
to  the  CH,CO  fragment  without  significant  rearrangement 
during  photodissociation,  a  simple  Franck-Condon  argu¬ 
ment  would  predict  a  large  amount  of  bending  excitation  in 
the  final  CO,  product  due  to  the  changing  geometry  alone. 
Clearly,  some  molecules  are  produced  with  substantial 
bending  excitation  since  ~3%  of  the  CO,  is  found  in  the 
01  '0,  02  ;0,  03  J0,  0440  states  with  a  distribution  correspond¬ 
ing  to  a  rather  high  temperature  (1800  K).  Nevertheless, 
this  represents  an  extremely  small  fraction  of  the  total  ener- 
gy  held  by  the  photodissociated  species.  Antisymmetric 
stretching  excitation  might  also  be  expected  as  the  C-O  bond 
lengths  change  during  the  evolution  from  pyruvic  acid  to 
free  CO,,  but  the  amount  of  such  excitation  is  observed  to  be 
an  order  of  magnitude  less  than  that  found  in  the  bending 
mode.  Of  course,  considerable  rearrangement  of  the  pyruvic 
acid  would  be  expected  to  occur  through  a  five  center  transi¬ 
tion  state  in  which  the  acid  H  atom  is  shared  by  two  O  atoms 
which  are  in  turn  attached  to  two  different  carbon  atoms.  In 
such  a  structure  the  simple  geometric  arguments  presented 
above  would  no  longer  be  expected  to  hold  because  the  disso¬ 
ciating  species  would  not  necessarily  have  the  ground  state 
geometry. 

The  majority  of  the  missing  energy  is  unlikely  to  have 
been  carried  away  by  a  vibrationally  hot  acetaldehyde  frag¬ 
ment.  While  acetaldehyde  does  have  15  vibrational  modes 
compared  to  4  for  C02,  it  is  extremely  difficult  to  see  how 
such  a  vibrationally  hot  species  {20  650-53  150  cm-1) 
could  separate  from  C02  without  leaving  the  latter  molecule 


much  hotter  than  is  observed,  especially  considering  that  a 
substantial  number  of  the  acetaldehyde  modes  have  higher 
vibrational  frequencies  than  those  of  CO,. 

There  appear  to  be  a  variety  of  possibilities  which  can 
account  for  the  “missing  energy.”  First,  there  is  more  than 
enough  energy  available  from  the  very  energetic  193  nm  UV 
photon  to  break  several  bonds  in  pyruvic  acid.  Thus,  photo¬ 
dissociation  may  actually  produce  more  than  two  fragments 
(e.g.,  CO,,  CO,  CH4,  etc.),  and  some  of  the  missing  energy 
could  simply  be  going  into  molecular  bond  breaking.  Small 
amounts  of  CO  and  CH4  have  been  observed  dunng  the  366 
nm  of  pyruvic  acid,  but  a  product  analysis  study  of  this  pho¬ 
toreaction  has  not  been  made  at  193  nm.1’  We  are  presently 
investigating  this  point  by  using  diode  lasers  to  probe  for  the 
presence  of  CO  and  CH4  following  193  nm  photodissocia- 
tion  of  pyruvic  acid.  Preliminary  results  indicate  that  a  mea¬ 
surable  amount  of  CO  is  formed  immediately  upon  photo¬ 
dissociation. 

Second,  the  missing  energy  could  be  accounted  for  if 
electronically  excited  states  of  the  products  are  formed.  The 
most  obvious  of  these  is  the  hydroxycarbene  intermediate/ 
isomer  which  is  20  650  cm  " 1  above  the  ground  state  of  nor¬ 
mal  acetaldehyde.  The  hydroxycarbene  intermediate  is  espe¬ 
cially  appealing  since  pyruvic  acid  is  expected  to  be  an  inter¬ 
nally  hydrogen-bonded  monomer  in  the  gas  phased7 
Nevertheless,  if  this  is  the  only  excited  species  formed,  large 
amounts  of  energy  are  still  available  for  vibrational  excita¬ 
tion  in  the  CO,  product.  A  higher  electronic  state  of  acetal¬ 
dehyde,  the  hydroxycarbene,  or  even  CO,  could,  of  course, 
account  for  a  large  amount  of  energy.  In  this  regard,  it  is 
interesting  to  note  that  strong  excitation  of  the  CO,  v,  mode, 
which  might  result  from  the  relaxation  of  such  a  highly  ex- 
cited  species,  was  observed  on  a  long  time  scale  in  the  present 
experiments.  In  addition,  absorption  signals  for  the 
01  ‘0—01 1 1  transition  in  CO,  show  a  similar  long  time  filling 
of  the  fundamental  bending  state.  While  a  full  characteriza¬ 
tion  of  the  long  time  behavior  of  all  vibrationally  excited 
states  is  needed,  the  present  observations  suggest  that  both 
bending  and  stretching  levels  of  the  product  CO,  molecule 
are  populated  to  a  similar  extent  on  a  long  time  scale,  prob¬ 
ably  due  to  collisions  with  hoi  photofragments  present  in  the 
cell  after  the  dissociation  process. 

Finally,  nascent  vibrational  populations  have  been  mea¬ 
sured  in  only  ten  low  lying  vibrational  levels  of  CO,.  CO, 
could  be  produced  in  vibrational  levels  higher  than  those 
probed,  i.e.,  the  observed  decrease  m  vibrational  population 
with  increasing  energy  may  be  part  of  a  bimodal  vibrational 
energy  distribution.  Mitigating  3gainst  such  a  picture  are 
"harvesting"  experiments,  where  pyruvic  acid  was  photodis¬ 
sociated  in  the  presence  of  a  large  background  of  cold  CO, , 
which  did  not  reveal  the  presence  of  vibrational  energy  held 
in  higher  vibrational  levels.'''  Such  experiments  are  not  fool¬ 
proof.  however,  and  the  possibility  of  a  bimodal  distribution 
cannot  be  entirely  ruled  out  at  the  present  time. 

The  possibility  that  a  large  amount  of  the  initially  depos¬ 
ited  photon  energy  appears  tn  the  translational  degrees  of 
freedom  of  the  recoiling  fragments  was  considered  by  taking 
advantage  of  the  high  resohitwu  of  the  diode  laser  and  mea¬ 
suring  the  product  C02  absorption  Hnewidth  immediately 
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following  the  photolysis  event.  Such  measurements  were 
performed  for  various  02*0 — 022 1  P,  Q,  and  R  branch  transi¬ 
tions  at  a  pressure  of  25  mTorr  (pure  pyruvic  acid)  where 
the  time  interval  between  collisions  is  ~4  p, s.  Under  these 
conditions,  no  linewidth  changes  were  observed  in  the  time 
interval  approximately  1  ps  after  the  excimer  laser  pulse. 
This  observation  suggests  that  little  translational  energy  ap¬ 
pears  among  product  C02  molecules  which  are  vibrational- 
ly  excited  during  the  photolysis  process  in  the  02 20  state.  The 
possibility  that  the  large  amount  of  ground  state  C02  photo¬ 
fragment  is  produced  translationally  hot  is  presently  under 
investigation  by  measuring  the  linewidth  of  the  00°0  —  00°1 
absorption  transition  after  the  photolysis  process. 

The  presence  of  a  large  amount  of  energy  in  the  transla¬ 
tional  degrees  of  freedom  of  the  product  molecules  could 
also  manifest  itself  in  the  rotational  equilibration  process 
which  occcurs  in  these  experiments  since  translations  and 
rotations  equilibrate  rapidly.  The  importance  of  such  an  ef¬ 
fect,  however,  is  expected  to  be  small  since  under  typical 
experimental  conditions  the  temperature  rise  in  the  cell  is 
5  10".  Such  a  small  increase  in  temperature  would  not  cause 
a  large  enough  shift  in  the  rotational  distribution  to  account 
for  the  transient  population  changes  which  are  observed  in 
the  present  experiments. 

Overall,  the  data  taken  in  the  present  experiments  may 
well  indicate  a  dual  channel  mechanism  for  photodissocia¬ 
tion.  One  channel  appears  to  produce  a  small  amount  of  CO, 
with  a  high  degree  of  vibrational  excitation,  and  could  repre¬ 
sent  the  conventional  Franck-Condon  picture  in  which  the 
CO,  center  is  changing  from  highly  bent,  sp1  character  to 
linear  sp  character.  The  second  channel,  which  produces  pri¬ 
marily  ground  state  CO,,  must  necessarily  proceed  through 
high  energy  transient  species  or  multiple  bond  breaking. 
Given  the  high  energy  of  the  UV  photon  used  in  the  present 
experiments,  several  surface  crossings  to  products  could  be 
available  during  photofragmentation.  Experiments  per¬ 
formed  at  a  variety  of  photolysis  wavelengths  would  be  very 
useful  in  helping  to  sort  out  the  photodissociation  mecha¬ 
nism. 

Finally,  the  present  experiments  also  indicate  that  C02 
is  produced  with  considerable  rotational  excitation.  Mea¬ 
surements  of  rotational  energy  distributions  in  the  00°0, 
01  '0.  and  0220  vibrational  levels  are  currently  in  progress. 

CONCLUSIONS 

A  tunable  IR  diode  laser  absorption  probe  technique 
has  been  employed  to  examine  nascent  vibrational  popula¬ 
tion  distributions  in  the  C02  photoproduct  resulting  from 
the  193  nm  laser  photolysis  of  pyruvic  acid  CH3COCOOH. 
Approximately  97%  of  the  C02  product  is  formed  directly 
in  the  vibrational  ground  state,  while  the  remaining  C02 
molecules  are  produced  with  a  significant  degree  of  vibra¬ 
tional  excitation.  These  latter  molecules  exhibit  vibrational 
energy  distributions  which  correspond  to  the  following 
mode  temperatures: 

r(v2)  =  1800  ±  150  K, 

r(v,)  =  3700  ±  1000  K, 

r(v2  +  v3)  =  2000  ±  400  K. 


These  results  suggest  that  the  193  nm  photolysis  may  pro¬ 
ceed  through  two  dissociation  channels  having  markedly 
different  fragmentation  mechanisms.  In  addition,  evidence 
of  substantial  rotational  excitation  has  been  observed. 

The  vibrational  excitation  in  the  C02  photofragment 
accounts  for  only  a  small  fraction  of  the  energy  available  in 
the  dissociation  process.  The  remaining  6  eV  of  available 
energy  is  sufficient  for  the  production  of  electronically  excit¬ 
ed  acetaldehyde  or  C02  products  as  well  as  the  further  frag¬ 
mentation  of  the  acetaldehyde  molecule.  Preliminary  results 
indicate  that  CO  molecules  are  also  produced  during  the  1 93 
nm  photodissociation.  A  more  complete  picture  of  the  pho¬ 
tofragmentation  dynamics  should  emerge  with  a  wavelength 
dependent  analysis  of  the  dissociation  products. 

These  experiments  demonstrate  the  power  of  time  do¬ 
main  diode  laser  absorption  spectroscopy  for  the  measure¬ 
ment  of  nascent  vibrational  state  distributions  in  polyatomic 
molecules.  Recent  experiments  at  lower  pressures  have  dem¬ 
onstrated  the  usefulness  of  this  technique  in  extracting  rota- 
tionally  resolved  product  state  distributions  for  a  variety  of 
experiments.28  We  are  presently  applying  these  methods  to 
examine  the  nascent  rotational  distributions  in  the  C02  pho¬ 
tofragment  from  pyruvic  acid  in  order  to  futher  elucidate  the 
dynamics  of  the  1 93  nm  photolysis.  The  extremely  high  reso¬ 
lution  of  the  diode  laser  (  —  0.0003  cm'1)  can  also  be  used  to 
measure  the  linewidth  of  the  nascent  product  states  in  order 
to  obtain  information  about  the  recoil  velocity  of  the  CO, 
fragment. 
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The  study  of  collisional  energy  tr:  •  sfer  between  transla¬ 
tional  and  internal  degrees  of  freedom  has  received  increased 
attention  in  recent  years.  An  efficient  source  of  transla¬ 
tional  energy  is  the  excimer  laser  photolysis  of  small  mole¬ 
cules  to  produce  hot  atoms  with  energies  of  several  eV.  In  the 
present  work,  hot  deuterium  atoms  have  been  produced  by 
photolysis  of  D;S: 

D;S  -i-  hv(  193  nm)  —  D*(£tr  =  2. 16 eV)  +  DS  .  ( 1 ) 
Inelastic  scattering  then  produces  excited  C02: 

D*  +  CO,  (00°0V)  —  D  -i-  C02  (00°1,/').  (2) 

High-resolution  diode  laser  spectroscopy  is  then  used  to 
probe  the  rotational  level  of  the  CO-  (00°1 )  state: 

CO,  <00°l.y')  +  Av(4.3/zm)-CO,  (00°2,/'  -  1)  .  (3) 

In  this  Communication  we  report  the  observation  of  a  pre¬ 
ference  for  excitation  of  high  J  relative  to  low  J  levels  of  CO, 
(00u! )  by  hot  D  atoms  compared  to  hot  H  atoms. 

This  experiment  is  based  on  an  earlier  study  of  the  rota¬ 
tional  distribution  of  CO,  (00°1,/)  due  to  collisions  with 
hot  H  atoms  at  2.30  eV  produced  by  the  photolysis  of  H-S.9 
In  that  study,  the  rotational  distribution  was  found  to  be 
peaked  at  J  =  33,  substantially  higher  than  the  room-tem¬ 
perature  Boltzman  distribution  peak  of  /=  15.  In  work 
completed  prior  to  that  study,7  8  the  total  number  of  quan¬ 


tum  in  the  CO,  asymmetric  stretch  and  bend  vibrations  was 
measured,  and  the  data  indicated  a  substantial  isotope  effect 
with  the  heavier  D*  atom  exciting  the  lower  frequency  bend 
preferentially. 

In  the  present  experiments.  193  nm  pulses  from  an  ArF 
excimer  laser  with  unstable  resonator  optics  ( Lambda-Phy- 
sik  EMG  201 )  are  propagated  down  a  9  ft  sample  cell 
through  which  flows  a  1/1  mixture  of  D.S/CO:  or  H,S/ 
CO,.  Tunable  cw  radiation  from  a  diode  laser  ( Laser  Ana¬ 
lytics)  at  4.3  fi m  is  propagated  through  the  cell  collinearly 
with  the  excimer  beam.  The  diode  laser  is  tuned  to  specific 
rotational  lines  of  the  00°1  —  00°2  vibrational  band  of  CO,. 
Time-resolved  changes  in  the  transmitted  IR  intensity  are 
measured  with  an  InSb  detector.  The  signals  are  digitized, 
summed,  and  sent  to  a  computer  for  storage  and  later  analy¬ 
sis. 

Transient  absorption  signals  of  the  00*’ 1  —  00°2  /*(  13). 
/*(  19).  P( 29),  P( 35).  P( 39).  />(4I),  /*(5I ).  P{ 59),  and 
P{  65 )  transitions  in  CO:  were  obtained  at  sample  pressures 
of  25  mTorr.  The  low  pressures  used  assure  that  rotational 
relaxation  of  the  CO,  (  —  4/zs)  is  substantially  slower  than 
the  rise  time  of  the  detection  system  (  —0.5 /zs),  permitting 
direct  measurement  of  nascent  rotational  populations.  The 
observed  signals  for  high  rotational  levels  of00°l  CO,  have 
an  initial  fast  rise  due  to  direct  formation  of  excited  CO,  by 
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RATIO  OF  C02(00*l,J)  EXCITATION  BY  H* vs.  D* 

MjS/COj-I/I;  25mtorr 


FIG  I  Experimental  values  for  NH./ND.  vs  CO.  rotational  level  iJ)  for 
1/1  mixtures  of  H.S/CO-  and  D-S/CO.  at  25  mTorr  N„.  (  ND.  is  the 
population  increase  in  a  given  00"!  rotational  state  after  the  H*(  D*  I/CO. 
collisional  interaction  The  solid  curve  is  a  linear  least-squares  fit  of  the 
data.  The  inset  scope  trace  shows  the  change  in  absorption  of  the  diode 
probe  beam  for  the  00"!  -00°2  Ft  39)  transition  after  193  nm  photolysis  of 
H.S  in  CO.  and  D.S  in  CO.  in  a  9  ft  cell 


hot  H  or  D  atoms  ( see  inset  of  Fig.  1 ) ,  followed  by  a  slower 
decay  which  is  attributed  to  rotational  relaxation.  For  low 
rotational  levels  of  00°  1  CO;.  the  fast  rise  is  small  or  nonexis¬ 
tent.  and  the  signals  have  a  slow  rise  due  to  rotational  relaxa¬ 
tion  which  fills  the  lower  J  states. 

To  obtain  the  rotational  distribution  directly,  the  fast 
rise  amplitudes  must  be  normalized  for  excimer  and  diode 
laser  intensities  and  absorption  line  strengths;  this  was  done 
in  the  earlier  H*  atom  experiment.9  In  the  present  experi¬ 
ments,  to  avoid  the  errors  involved  with  such  normaliza¬ 


tions,  ratios  of  H*  and  D*  atom  fast  nse  amplitudes  were 
measured  by  locking  the  diode  Laser  on  a  given 
00°  1 ,  J '  —  QfflJ  '  —  1  transition  and  flowing  through  the  gas 
cell  an  H2S/C02  sample  and  then  a  D:S/CO;  sample.  When 
plotted  as  a  function  of  rotational  level,  these  ratios  show  a 
significant  negative  slope  (see  Fig.  1 ).  From  the  ratios  and 
the  previously  measured  H*  atom  distribution,9  the  D*  atom 
distribution  can  be  obtained.  While  the  overall  vibrational 
excitation  of  C02  (00°1,./)  by  H*  atoms  is  about  twice  as 
efficient  as  excitation  by  D*  atoms.  Fig.  1  shows  a  clear  pre¬ 
ference  for  higher  J  by  deuterium  as  compared  to  hydrogen. 

The  available  angular  momentum  in  a  hot  H  or  D  colli¬ 
sion  is  L  =  fibg  where  fx  is  the  reduced  mass,  b  the  impact 
parameter,  and  g  the  relative  speed.  For  fixed  b,  DVCO. 
collisions  have  a  v7  larger  L  than  H*/CO:  collisions.  This 
probably  accounts  for  the  increasing  excitation  probability 
of  high  J  00°  1  levels  by  D*  atoms  compared  to  H*  atoms 
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Time  domain  absorption  spectroscopy  with  a  tunable  diode  laser  has  been  used  to  measure  the  rate  constant  for  vibra- 
tion-to-vibration  energy  transfer  m  COj  via  the  near-resonant  “collisional  up-pumping'  process  CO;(00°1  I  +  CO:iOOcl  ) 
*  CO:(00°2)  +  C0;(00°0)  The  diode  laser,  in  combination  with  a  CO;  discharge  reference  cell,  is  used  to  monnoi  the 
time-dependent  population  increase  within  the  CO;(00°2)  state  in  a  10  1  mixture  of  Ar  and  CO;  following  pulsed  excitation 
of  the  CO:(00°! )  level  by  a  Q-switched  CO:  laser  The  intramode  relaxation  rate  constant,  k.  lor  this  process  is  found  to 
be  ( 5  6  ±  0.7)  X  10*  s'1  Torr"1  or  (I .7  ±  0  2)  x  10'10  cm1  s'1  molecule at  298  K 


Introduction 

The  dynamics  of  both  pulsed  and  CW  CO-  lasers  have  been 
the  subject  of  extensive  research  for  many  years  The  behavior 
of  these  systems  is  now  rather  well  understood,  and  kinetic  models 
agree  well  with  experimental  results  Such  models  are  based  upon 
numerous  rate  constants  for  vibration-to-vibration  (V-V)  and 
vibration-to-translation  (V-T)  energy  transfer  between  CO;,  N-. 
and  rare  gases,  values  for  which  have  been  measured  by  infrared 
fluorescence  and  CO-  laser  double-resonance  techniques.-’10 
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Perhaps  the  most  fundamental  V-V  energy-transfer  process  in 
C02  is  the  near-resonant  “collisional  up-pumping"  process 

CO;(00(,I  I  +  CO;(00°1 )  z=7  CO-(00°2l  + 

CO:(00°0)  2i<  *  -25  cm-  ( I  I 

which  serves  as  a  model  for  how  vibrational  energy  is  distributed 
within  the  asymmetric  stretch  U'j)  mode  of  CO.  The  intramode 
relaxation  rate  constant,  k.  for  this  process  is  expected  to  be 
approximately  gas  kinetic  and  has  been  calculated  by  Pack.11  but 
an  experimental  value  for  the  room-temperature  rate  constant  has 
not  been  published  to  date  10  We  have  recently  measured  this 
rate  using  the  powerful  combination  of  a  tunable  diode  laser 
(TDL)  spectrometer  and  a  CO;  discharge  reference  cell  The 
diode  laser  provides  a  high-resolution  probe  I  ~0  0007  cm1)  for 
time  domain  absorption  spectroscopy  and  has  been  recently  used 
to  probe  CO;  laser  dynamics  ’  The  CO-  discharge  reference  cell 


181  Finn.  J  .  Moore.  C  B  J  Chem  Phvs  t*73.  63.  225* 

1 91  Burak.  I  .  Noter.  Y  .  Sroke.  A  IEEE  J  Quantum  Electron  1973.  v 
54  I 

1 10)  Thomason.  M  D  Ph  D  Thesis.  I  niversity  of  V  irgmu.  L  os  Alamos 
National  Laboratory.  L A9420-T  1952 

til)  Pack.  R  T  J  Chem  Phvs  1*88.  *7.  M40  Note  that  Pack  s  rate 
constant,  k.  written  in  terms  of  the  disappearance  of  CO.tfiO0t  t,  is  twice  as 
large  as  that  defined  in  eq  I  and  2 
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Letters 


COLLlSlONAL  up-pumping 
CO,  (00*1 )  +  CO,  (00*1 )  5±  C0,(00*2)  ♦C0J(00*0) 


DIODE  LASER  ABSORPTION 

C0,(00*2;j*  20)  +  ,4.3^m) - *00,(00*3;  J  ■  21  ) 

Ftfiarv  1.  Time  domain  absorption  signal  Tor  (he  (00°2)  —  (00°3)  R{ 20> 
transition  of  CO;  following  excitation  of  the  (00°  I )  level  by  9  6-<im 
radiation  from  a  Q-switched  CO;  laser  The  sample  is  a  1.10  mixture 
of  CO;  Ar  at  a  total  pressure  of  I  0  Torr  and  a  temperature  of  298  K 
The  trace  represents  the  average  of  32000  signals  and  has  a  time  base 
of  20  us  full  scale 

creates  steady-state  populations  of  vibrationallv  hot  CO;  and 
provides  frequency  references  for  thousands  of  high-lying  rovib- 
raliona!  lines  which  are  normally  inaccessible  at  room  temperature 
The  TDL /discharge  cell  combination  allows  virtually  every  im¬ 
portant  rovibrational  level  in  the  CO;  laser  to  be  probed  with 
extremely  high  temporal  end  spectral  resolution 

Experimental  Section 

In  the  present  double- resonance  experiment,  the  9  6-um  output 
of  a  Q-switched  CO-  laser  is  propagated  through  a  2-m-long 
sample  cell  containing  a  1:10  mixture  of  CO-  and  argon,  popu¬ 
lating  the  C0-(00°1 )  vibrational  state.  The  argon  promotes  ro¬ 
tational  relaxation  within  each  CO;  vibrational  level  without 
significantly  affecting  vibrational  relaxation,  so  that  the  rotational 
levels  are  described  by  a  Boltzmann  distribution  throughout  the 
experiment.  In  addition,  the  rare  gas  limits  radial  diffusion  of 
excited  CO-  molecules  out  of  the  probe  beam  The  diode  laser, 
which  is  used  to  monitor  (via  absorption  at  4  3  um)  the  time- 
dependent  populations  in  the  (00°2)  level,  is  copropagated  along 
the  cell  axis,  passed  through  a  monochromator  (to  discriminate 
against  competing  spatial  and  longitudinal  modes),  and  detected 
with  a  cooled  (77  K)  InSb  detecor.  Time-resolved  changes  in 
the  transmitted  intensity  of  the  diode  radiation  are  acquired  with 
a  Biomation  8100  transient  digitizer  and  averaged  on  a  Nicolet 
1 1 70  signal  averager  A  trigger  is  provided  by  detecting  a  portion 
of  the  CO;  laser  pulse  (— 0  4-ws  width)  with  a  cooled  (77  K) 
HgCdTe  detector  having  a  response  time  of  —0.1  «s.  A  typical 
time  domain  absorption  signal  for  the  (00°2)  level  is  shown  in 
Figure  1 ,  where  the  diode  laser  is  tuned  to  the  ( 00°2 )  —  (00°3) 
R(  20)  transition  at  2313  959  cm'1  In  order  to  locate  this  ab¬ 
sorption  line  and  lock  the  diode  laser  to  this  frequency,  a  portion 
of  the  beam  is  split  off  before  the  sample  cell  and  directed  through 
a  CO-  discharge  reference  cell  and  into  a  monochromator/  IR 
detector  The  reference  cell  absorption  signal  is  sent  to  a  lock-in 
amplifier  whose  output  is  fed  back  into  the  diode  laser  current 
controller  for  frequency  stabilization  This  configuration  fixes 
the  frequency  of  the  diode  laser  at  the  peak  of  a  single  absorption 
line  throughout  the  duration  of  the  experiment 

The  discharge  reference  cell  consists  of  a  high-voltage  dc 
discharge  (16  kV;  25  mA)  applied  to  a  low-pressure  mixture  of 
CO,.  N-,  and  He  It  provides  a  steady-state.  non-Boltzmann 
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Hfvt  2.  Typical  CO;  discharge  cell  absorption  spectra  in  the  2300-cm 
region  with  the  discharge  (a  I  turned  off  and  I  bl  ignited  i  25  mA  i  Sole 
that  the  intensity  of  the  (00°2)  —  (00“3 1  Ri  1 4 1  line  ai  23 10  035  cm 
is  increased  by  8  orders  of  magnitude  when  the  discharge  is  turned  on 

population  of  highly  vibrationally  excited  CO;,  with  "effective" 
(Treanor-type1  J)  vibrational  mode  temperatures  as  high  as  2900 
K.'!  In  this  experiment,  the  discharge  enhances  the  (00°3i 
population  by  more  than  8  orders  of  magnitude  (from  —  I0'10  lo 
0.05  of  the  ground-state  population),  providing  a  precise  frequency 
reference  for  each  (00°2)  —  (00°3)  absorption  line  Accurate 
assignments  of  the  additional  lines  present  in  the  discharge  are 
made  possible  by  high-resolution  FTIR  spectroscopy  of  discharge 
excited  CO-  14  Typical  reference  cell  absorption  spectra  are  shown 
in  Figure  2.  Note  the  dramatic  increase  of  available  absorption 
fines  when  the  discharge  is  emp/oved 

Data  Aaalysis 

The  kinetics  of  the  collisional  up-pumping  process  given  in  eq 
I  may  be  modeled  simply  by 

S,(t)  -  *((V,(r)]:  -  Ic.iA'oA'jd)  (2) 

where  ,V„( t )  is  the  population  in  (00 °n)  and  k  and  k.t  are  the 
forward  and  backward  rate  constants.  Since  ,V0  and  .V,(r)  are 
not  significantly  perturbed  by  the  up-pumping  process,  eq  2  is 
uncoupled  from  the  analogous  equations  for  ,V0  and  .V^r).  As 
a  result,  the  general  solution  to  eq  2  may  be  written 

2V,(f)  -  k  r,[2V,(r)]I<-  v’>*  •*'  '*  dr  (3) 

As  a  first  approximation,  we  assume  that  the  (00°1 )  population 
has  no  significant  time  dependence  on  the  time  scale  of  interest, 
l,  .V0A.i.  and  so  ,V,(r)  mav  be  approximated  by  using  a  step 
function  with  amplitude.  .V,  Assuming  .V;(r<0)  ---  0.  the  solution 
to  eq  3  is  given  by 

.V.(r)  »  ,V,(-)[l  -  e  'J  (4i 

where  ,V:t-)  «  (.V, ) 2*/.V0/c_ , 

Time  domain  absorption  signals  were  taken  on  the  (0O°2)  Rl ! 4 1 
and  R ( 20 )  lines  at  total  pressures  ranging  from  0  3  to  10  Torr 
(i  e  .  CO-  partial  pressures  of  27  3-909  mTorr)  The  signals  were 
fit  to  the  right-hand  side  of  eq  4  multiplied  by  a  term  e^'  to 
account  for  the  slow  (  —54  ms'1  Torr  ref  4)  relaxation  of  00°1 
3nd  diffusion  of  molecules  out  of  the  diode  beam  Fitted  values 


1 1 2)  Tremor.  C  E  ,  Rich.  J  SV  .  Rebm.  R  G  J  them  Phi;  |9'T0 
)9hb 

1 1 3)  Dang.  C  .  RcuJ.  J  .  Garxide.  B  K  hpp/Phii  19*2  B."  145  Bulls 
D  .  Rossctu.  C  ,  GueUchvih.  G  Chtm  Phn  19*5  /On.  101 

(I4|  Baiily.  D  .  Fxrrenq.  R  .  Rossem.  C  J  Mo!  Sprctmu  197*  '0  i  ya 
Bailly.  D  .  Farrenq.  R  .  Guelachvih.  G  .  Rossetti.  C  J  Mol  Spmrosc  19*1 
90.  14  Ba illy.  D  3  eme  Cycle  Thesis.  I  Diversity  of  Psris-Sud.  1970 
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Fifur*  3.  Graphs  of  signal  rise  rate.  A. , ,V0.  versus  CO:  partial  pressure 
for  la)  a  simple  exponential  solution  .V.(i)  given  in  eq  4  and  (b)  an 
induction  period  solution  .V,**- (r)  given  in  eq  5  Fitted  slopes  for  these 
graphs  are  given  by  la]  k. ■  ■  (3  3  ±  0  8)  X  10*  s'1  Torr"1  and  (b)  k 
“  I  5  0  x  0  ft  I  x  lOG  Torr  1  Only  solid  data  points  were  used  in  the 
linear  fit  of  the  slope.  A.,.  and  in  Ibl  the  intercept  was  constrained  to  be 
equal  to  zero 

for  A.,.V0  are  graphed  as  a  function  of  pressure  in  Figure  3a.  and 
linear  behavior  is  observed  up  to  CO;  parrtial  pressures  of  450 
mTorr  The  slope  of  these  data  points  is  given  by  A.,  =  (3  3  ± 
0  8)  x  10*  s  1  Torr'1.  i.e  ,  a  forward  rale  for  eq  I  of  A  =  (3  7  ± 
0  9)  x  10*  s  ’  Torr1  The  observed  rates  roll  off  at  a  maximum 
of  1400  ms"1,  corresponding  to  the  delector/amplifier  response 
time  of  '•0  7  us 

Typical  (00°2)  time  domain  absorption  signals  (see  for  example 
Figure  1 1  exhibit  a  small  induction  penod  following  the  CO;  laser 
pulse,  suggesting  the  presence  of  a  secondary  process  which  occurs 
on  the  same  time  scale  of  the  signal  of  interest  We  believe  that 
this  feature  may  be  a  tributed  to  the  finite  response  (  ~0  7  qs) 
of  the  detection  system  In  order  to  improve  upon  the  simple  data 
analysis  given  above,  the  observed  signal,  .V;°**a(rl.  is  a  modeled 
by  using  a  standard  first-order  differential  equation  to  describe 
the  detector  amplifier  response,  driven  bv  the  simple  exponential 
solution  for  V,(rl  given  in  eq  4  The  observed  time  domain  signal 
is  then  given  by 

*.■*-(,)  .  .V,(/)jl  -  -  -  —  [A  .»-*  -  «e"‘  v*]j  (5) 

where  b  is  a  free  parameter  representing  the  response  time  of  the 
detection  system  The  measured  (00°?)  signals  were  fitted  to  the 
right-hand  side  of  eq  5.  multiplied  by  a  term  e  "  as  before  As 
shown  in  Figure  4,  exhibits  an  induction  period  similar 

to  that  seen  in  the  data  Values  for  A.,.V0  obtained  from  this 
analysis  are  graphed  as  functions  of  CO;  partial  pressure  in  Figure 


TIME  I  |UK  ) 

Figure  4.  Time  domain  absorption  signal  for  the  |00°2)  —  I00°3)  R 1 20 1 
transition  for  a  1:10  mixture  of  CO-  and  Ar  at  a  total  pressure  of  I  0  T orr 
and  a  temperature  of  298  K  The  best  fit  to  this  data  using  eq  5  is  shown 
by  the  smooth  line 

3b.  When  fitted  to  a  straight  line  with  zero  intercept,  the  fitted 
slope  is  A.,  »  (5.0  ±  0  6)  x  10*  s"1  Torr"1,  i.e..  a  forward  rate 
of  A  “  (5.6  ±  0  7)  X  10*  s"1  Torr"1.  The  fitted  value  for  6  is  fairly 
constant  over  a  large  pressure  range,  having  an  average  of  15 
ms"1  (a  response  lime  of  0.70  ms),  exactly  the  value  observed  for 
the  roll  off  of  the  rates  plotted  in  Figure  3a 

In  the  fitting  model  employed  above,  the  <  00°  1 )  population  is 
treated  as  a  step  function,  rising  to  us  maximum  value.  .V,.  im¬ 
mediately  upon  firing  the  CO;  laser  An  alternative  explanation 
of  the  observed  induction  period  may  be  the  presence  of  significant 
time  dependence  in  the  (00°l )  population  on  the  time  scale  of  our 
measurement.  Processes  which  might  be  responsible  for  such  a 
time  dependence  include  ( 1 )  incomplete  rotational  equilibration 
within  the  (00°l )  level,  (2)  the  finite  rise  time  of  the  CO:  laser 
(measured  pulse  width  ~0  4-ms  fwhm).  and  (3)  depletion  ( • —  8*3F ) 
of  the  (00°l )  level  by  the  fast  process 

CO;(00°1)  +  COj(Ol'O)  — 

CO-(OlM)  4-  CO.(00°0i  ,i«  -  -13  cm  (6) 

which  occurs  at  a  rate  of  (5.3  ±  I)  x  ID*  s"1  Torr"'  t ref  9) 
However,  additional  modeling  of  these  processes  yields  essentially 
the  same  value  for  A  given  above,  suggesting  that  eq  5  is  adequate 
for  extracting  A  from  the  data 

Discussion 

reproduces  the  induction  period  seen  in  the  observed 
time  domain  (00°2)  signals,  and  as  a  result,  the  fitted  rise  times 
are  faster  than  those  obtained  with  the  simple  exponential  solution 
for  .V.(r)  given  in  eq  4.  The  model  yields  a  correct  value  for  the 
time  response  of  the  detection  system  (as  determined  in  Figure 
3a)  and  provides  a  small  correction  to  the  value  for  the  rale 
constant  obtained  via  eq  4  Our  best  estimate  for  the  collisional 
up-pumping  rate.  A.  is  thus  (5.6  ±0.7)  x  10*  s'1  Torr'1  or  ( 1  7 
±  0.2)  x  10  10  cm1  s'1  molecule"2  at  298  K  This  value  is  quite 
close  to  both  previous  estimates10  '1  and  rates  for  similar  near¬ 
resonant  processes The  distorted  wave  Born  alculalions  of 
Pack"  yield  a  value  of  A  *  9,2  x  10*  s"1  Torr"1,  but  he  suggests 
that  the  actual  rate  is  only  0.75  of  this  value,  i.e..  6  9  x  10*  s  : 
Torr  1  In  addition.  Thomason  gives  a  value  of  (7  61  ±  0  24)  x 
10*  s'1  Torr  1  at  700  K.  measured  by  double-resonance  using  a 
sequence  band  CO:  laser.10 

In  summary,  we  have  measured  the  rate  for  collisional  up- 
pumping  in  CO;  using  the  powerful  combination  of  a  tunable  diode 
laser  spectrometer  and  CO;  discharge  reference  cell  This  ap¬ 
paratus  provides  a  high-resolution  absorption  probe  capable  of 
measuring  CO.  populations  in  rovibrational  levels  as  high  as  3 
eV  in  energy  We  are  presently  using  this  TDL  discharge  cell 
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combination  to  measure  the  nascent  rotational  and  vibrational 
distributions  of  excited  C02  molecules  created  by  a  variety  of 
chemically  interesting  processes  such  as  photodissociation,1  s 
electronic  quenching,  14  inelastic  scattering, 17  and  bimolecular  and 


(15)  O'Neill.  J.  A.:  ICreutz.  T.  G  ,  Flynn.  G.  W„  accepted  for  publication 
in  J.  Chem.  Phys.  Wood,  C.  F.;  O'Neill,  J.  A.;  Flynn.  G.  W.  Chem.  Phys  Leu. 
1984.  109.  317 

(16)  Brady.  B  B.;  Spector.  G  B.;  Chia,  L.;  Flynn.  G  W.  J  Chem  Phys 
1987.  88.  3245. 

(17)  O'Neill.  J  A..  Wang.  C.  X..  Cai.  J.  Y  ;  Flynn,  G.  W.;  Weston,  Jr., 
R  E.  J  Chem  Phys  1986.  85.  4195  O'Neill.  J  A  ;  Cai.  J  Y  .  Flynn.  G 
W  .  Weston,  Jr  .  R  E.  J  Chem  Phys  1986,  84.  50.  Chu.  J  O  i  Wood.  C 
F  .  Flynn.  G  W  .  Weston.  Jr  .  R.  E  J.  Chem  Phys  1984.  80.  1703;  1984. 
81 .  5533.  Hewitt,  S.  A.;  Hershberger,  J.  F.;  Flvnn.  G.  W.,  Weston,  Jr.,  R. 
E  J  Chem  Phys.  1987,  87.  1894 


surface  chemical  reactions.11 
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Direct  formation  of  dielectric  thin  films  on 
silicon  by  low  energy  ion  beam  bombardment 

S  S  Todorov,  C  F  Yu  and  E  R  Fossum,  Department  of  Electrical  Engineering  and  Columbia  Radiation 
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Dielectric  films  with  thickness  of  the  order  of  50  A  ate  obtained  at  room  temperature  by  bombarding  exposed 
silicon  surfaces  with  an  oxygen-containing  ion  beam  of  energy  60  eV.  Silicon  nitride  thin  films  have  also  been 
formed  using  the  same  technique.  The  film  thickness  is  self -limited  and  largely  independent  of  ion  dose  AES 
and  XPS  analysis  of  the  produced  thin  films  indicate  that  the  films  are  not  entirely  stoichiometric  and  contain 
lower  oxidation  states  of  silicon.  The  high  electrical  quality  of  the  oxide  films  is  demonstrated  by  the  successful 
fabrication  of  n- channel  MOS  transistors  with  gate  dielectrics  obtained  by  ion  beam  oxidation  at  room 
temperature. 


Introduction 

Silicon  dioxide  is  an  indispensable  materia!  lor  silicon  MOS 
technology  where  its  excellent  dielectric  properties  are  used  both 
for  insulation  and  field  enhancement  in  the  finished  devices  and  at 
various  stages  of  the  fabrication  process.  Oxides  for  such 
applications  are  usually  grown  on  the  surfaces  of  silicon  wafers  in 
oxidizing  ambients  at  temperatures  exceeding  900  C.  In  some 
applications,  such  as  thin-film  FETs  on  low-melting  point 
substrates,  e  g  for  flat  panel  displays,  an  alternative  to  thermal 
oxidation  must  be  found  Reduced-temperature  oxidation  is  also 
desirable  in  conventional  silicon  technology  since  repeated 
heating  and  cooling  of  the  substrate  leads  to  thermal  stresses, 
crystal  defects,  wafer  warpage  and  impurity  redistribution  These 
become  increasingly  important  with  the  trend  toward  smaller 
device  dimensions  and  the  efforts  to  fabricate  stacked  three- 
dimensional  MOS  structures. 

Various  techniques  have  been  proposed  for  oxidation  of  silicon 
at  reduced  temperatures,  including  plasma  oxidation  and  anodi¬ 
zation.  chemical  vapour  deposition  tCVDl  and  plasma-enhanced 
CVD.  reactive  sputtering,  evaporation  or  sputtering  in  an  oxygen 
ambient,  and  a  number  of  other  methods  However,  few  of  these 
have  successfully  produced  thin  oxides  suitable  for  gate  dielec¬ 
trics.  Recent  reports  of  successful  MOSFET  fabrication  using 
oxidation  at  reduced  temperatures  include  microwave  plasma 
oxidation  at  580  C'.  plasma  oxidation  at  500  C2  and  rf  sputter 
deposition  at  200  C  for  10  h\ 

In  this  paper,  we  report  on  the  use  of  a  broad  low-energy  ion 
beam  for  direct  formation  of  ultra-thin  device-quality  oxide  and 
nitride  films  on  silicon  The  method  has  allowed  the  first 
successful  fabrication  of  FET-quality  thin  oxides  at  room 
temperature  Low  energy  ion  beam  oxidation  has  the  added 
advantage  of  being  a  clean  vacuum  process  with  independent 

Research  supported  by  Joint  Services  Electronics  Program  under  coniraci 
DAAG29-85-K-<1049  and  an  IBM  Faculty  Development  Award 


control  over  the  process  parameters  and  compatibility  with  other 
ion  beam  or  vacuum  techniques 

Broad  low-energy  ion  beams  have  previously  been  applied  to 
the  oxidation  of  metals  for  fabrication  of  superconducting'1  and 
MOM'  structures  and  for  investigation  of  reaction  kinetics  in 
reactive  near-threshold  sputtering6  Focused  ion  beams  have  also 
been  used  to  study  the  oxidation  kinetics  on  1 1 1 1 1  silicon" 

Fabrication 

The  experiments  were  performed  using  a  single-grid  Kaufman- 
type  ion  source,  shown  schematically  in  Figure  1.  The  source 
produces  a  2.5  cm  dia  ion  beam  and  is  capable  of  extracting 
comparatively  large  current  densities  at  low  applied  \  oltages.  The 
extracted  ions  impinge  on  an  electrically  grounded  target 
positioned  15  cm  from  the  source  on  a  copper  substrate  holder 
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Figure  I.  Schematic  diagram  of  low  energy  ion  beam  experimental  set-up 
Base  pressure  3  *  10'  ’  lorr 
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whose  temperature  is  monitored  by  a  surface  mounted  thermo¬ 
couple.  The  target  temperature  is  observed  to  rise  only  several 
degrees  above  room  temperature  during  ion  beam  treatment. 

To  demonstrate  the  high  electrical  quality  of  the  ion  beam 
grown  thin  dielectrics,  MOS  transistors  were  fabricated  using  the 
following  procedure  on  5  cm  dia  (100)  5  Q  cm  p-type  silicon 
wafers,  polished  on  one  side.  The  wafers  are  chemically  cleaned  in 
degreasing  agents.  H,S0*  H,0,  and  10%  HF.  p*  channel  stops 
and  n'  source  and  drain  junctions  are  formed  by  high- 
temperature  (1050;C)  diffusion  of  boron  and  phosphorus,  respec¬ 
tively,  from  spin-on  sources  through  oxide  masks  prepared  by 
oxtdation  in  steam  at  1000;C.  A  5000  A  field  oxide  is  grown  under 
the  same  conditions  and  gate  regions  are  etched  through  to  the 
silicon  surface  using  a  standard  wet  buffered  oxide  etchant  ( BO  E  I. 
Immediately  before  ion  beam  treatment  the  wafers  are  dipped  in  a 
cold  dilute  HF  solution  to  remove  any  residual  native  oxide  from 
the  gate  regions.  After  ion  beam  treatment,  contact  holes  are 
opened  through  the  thin  dielectric  and  3000  A  of  aluminium  is 
evaporated  and  patterned  to  form  the  contact  metallization. 
Then.  3000  A  of  aluminium  is  evaporated  on  the  backside  of  the 
wafer  to  form  a  substrate  contact  Brief  post-metallization 
annealing  is  carried  out  for  3  min  in  N.  at  400  C 

To  investigate  the  thickness  of  the  thin  films,  experiments  were 
also  performed  on  simpler  substrates  on  which  only  MIS 
capacitors  were  fabricated  according  to  the  above  procedure 
except  nop'  and  n  '  regions  were  diffused  and  no  contact  holes 
were  etched.  These  wafers  also  have  large  bare  regions  free  of 
devices  used  to  verify  the  thickness  of  the  films  by  ellipsometry 
measurements 

To  ensure  that  the  deposited  films  are  grown  by  direct  ion 
bombardment  and  are  not  due  to  redeposition  of  material 
sputtered  from  portions  of  the  wafer  covered  by  the  field  oxide,  ion 
beam  oxidation  was  also  carried  out  on  bare  silicon  wafers 
cleaned  and  dipped  in  HF  following  the  same  procedure  The  film 
thickness  of  these  samples  was  measured  on  a  Gaertner  Model 
L 1 1  *  ellipsometer  immediately  after  ion  beam  bombardment  and 
over  a  period  of  several  days  to  ensure  stability  of  the  deposited 
films  These  samples  were  also  characterized  by  Auger  electron 
spectroscopy  and  by  XPS  in  a  Leybold-Heraeus  LHS-10  surface 
analysis  system  equipped  with  a  hemispherical  mirror  analyser 
The  base  pressure  in  the  characterization  chamber  is  4« 
10' 10  torr.  Sputter  profiling  of  the  samples  is  done  at  I  x 
10'  torr  argon 

Results  and  discussion 

The  dependence  of  the  film  thickness  obtained  by  ellipsometry  on 
the  duration  of  ion  beam  treatment  is  shown  in  Figure  2  Values 
for  the  oxide  thickness  were  also  obtained  from  the  accumulation 
capacitance  of  MIS  capacitors  measured  after  post-metallization 
annealing.  The  oxide  thickness  calculated  from  C- 1  measure¬ 
ments  before  annealing  is  unreliable  due  to  senes  resistance  in  the 
evaporated  aluminium-substrate  contacts 

The  ellipsometry  measurements  indicate  that  after  a  rapid 
initial  growth  rate,  the  oxide  thickness  increases  very  slowly  with 
increasing  exposure  times  This  can  be  explained  by  the  very 
limited  diffusion  of  oxygen  atoms  in  the  silicon  at  this  low 
temperature  and  the  inability  of  fresh  oxygen  arrivals  to  reach 
unreacted  silicon  It  should  be  noted  that  the  thicknesses  obtained 
from  the  C- I  measurements  are  all  consistently  slightly  higher 
than  the  values  obtained  by  ellipsometry  This  is  probably  due  to 
amorphization  of  the  thin  film  by  the  bombarding  ions  which 


Figure  1  Oxide  layer  thickness  obtained  by  ellipsometry  as  a  function  of 
ion  beam  exposure  O,  partial  pressure  6.5x10"’ torr  Ion  energy 
60  eV 


would  lead  to  the  films  having  a  different  dielectric  constant  than 
that  of  bulk  SiO,  which  is  assumed  in  the  calculations  of  the 
thickness  based  on  C-V  measurements. 

This  figure  is  a  good  illustration  of  the  self-limiting  behaviour  of 
the  obtained  thin  films  Self-limitation  was  observed  in  the 
measurements  of  the  resistance  of  thin  films  grown  by  direct  ion 
beam  oxidation  of  nickel5  and  for  etch  depths  obtained  by  reactive 
sputter  etching8,  where  the  self-limitation  is  believed  due  to  the 
competing  effects  of  deposition  and  sputtering 

This  dose  independence  has  the  useful  property  of  producing 
very  uniform  oxide  thickness  across  large  targets  The  uniformity 
of  the  oxides  is  shown  in  Figure  3  where  it  is  plotted  against  the 
ion  beam  intensity  profile  The  oxide  thickness  is  obtained  from 
the  accumulation  capacitance  of  2.8  x  |0  ' 3  cm;  MOS  capacitors 
at  various  wafer  sites.  The  ion  beam  intensity  is  measured  by  a 
1. 13  cm  dia  current  probe 

Figure  4  follows  the  evolution  of  the  SilLVVi  and  O  (KLLi 
peaks  in  the  Auger  spectrum  of  an  ion  beam  oxide  and  a  thermal 
oxide  measured  by  ellipsometry  at  47  A  and  48  A.  respectively 
The  thermal  oxide  sample  was  prepared  by  placing  a  cleaned 
silicon  wafer  in  a  pure  dry  oxygen  atmosphere  at  900  C  for  8  min 
AES  is  performed  with  a  3  keV  electron  beam  having  a  1  mm  spot 
size  at  the  target.  Profiling  is  accomplished  by  sputtering  the 


Figure  3.  Comparison  of  oxide  thickness  uniformiiy  across  a  5  cm  Si  wafer 
ifrom  f  I’  measuremems  of  2  43  x  10  1  cm'  MIS  capaciiorsi  and  ion 
beam  curreni  density  profile  at  the  target  Curreni  densuy  probe  diameter 
I  13  cm  O.  partial  pressure  6.5  x  10  ’  torr  Ion  energy  60  eV  Exposure 
lime  6  min  Accelerating  '.oltagc  20  V 
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Figure  4.  Depth  profiles  of  thermal  and  ion  beam  oxides  Plotted  are  the 
peak-to-peak  amplitudes  of  SilLWl  and  OlKLLl  Auger  signals, 
"—thermal  Si:  '  — thermal  O:  ■ — ion  beam  Si:  • — ion  beam  O 

sample  with  a  differentially  pumped  rastertng  3  keV  argon  ion 
source  with  current  density  1.2  qA  cm "  ’.  Based  on  elhpsometry. 
the  sputtering  rate  for  StO,  is  3.5  A  min " l.  The  depth  profile  of 
the  thermal  oxide  indicates  a  linearly  decaying  signal  correspond¬ 
ing  to  uniform  composition  of  the  film.  The  oxygen  profile  for  the 
ion  beam  oxide  has  a  peak  at  a  depth  of  about  two  monolayers 
This,  as  well  as  the  attenuation  of  the  silicon  signal  would  indicate 
higher  concentration  of  impurities  near  the  surface.  The  complete 
surface  Auger  spectrum  contains  a  tungsten  peak  due  to 
sputtering  of  the  source  cathode  and  neutralizer  filaments  The 
shallower  slope  of  the  concentration  profiles  in  the  ton  beam 
sample  compared  to  the  thermal  oxide  would  also  indicate  a 
larger  width  of  the  interface  region  probably  due  to  beam  induced 
disorder 

Angle-resolved  XPS  spectra  of  a  48  A  thermal  and  a  47  A  ion 
beam  oxide  are  shown  in  Figure  5  XPS  is  performed  using  a  non- 
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Figure  5.  Angle-resolved  XPS  spectra  of  t a i  thin  ihermal  oxide  and  ibi  ihm 


monochromatized  Mg-K,  source  providing  1253  6  eV  X-rays  at  a 
power  of  240  W.  The  analyser  is  calibrated  using  the  Au  4P  2 
peak  with  binding  energy  IBE)  83.8  eV  The  calibration  and  the 
shifts  due  to  substrate  charging  are  checked  throughout  the  runs 
by  monitoring  the  C„  peak  I  BE  =  284  5  eV  ).  Both  the  O , ,  and  Si.p 
peaks  were  monitored  with  progressively  shallower  sampling 
depth  as  the  detector  was  oriented  progressively  farther  away 
from  the  surface  normal.  Figure  5<al  shows  the  presence  of  a  sharp 
O,,  peak  with  BE  =  533. 1  eV  at  all  detector  angles  which  is 
evidence  of  the  uniform  stoichiometry  of  the  thermally  grown 
oxide  film.  The  double  Si2p  peak  includes  the  silicon  signal  from 
the  oxide  layer  (BE=!03  9  eV|  and  the  substrate  silicon  signal 
(BE  =  99.4  eV).  The  intensity  of  the  substrate  Si,p  peak  decreases 
as  the  sampling  depth  decreases  and  disappears  at  tf  =  70  The 
O,,  peak  of  the  ion  beam  oxide,  as  shown  in  Figure  5tbi  is  much 
broader  and  comprises  two  unresolved  components  The  higher 
binding  energy  one  (BE  =  532.1  eV)  dominates  at  b  =  0  The 
lower  binding  energy  peak  iBE  =  531.6  eVi  gradually  increases  in 
amplitude  relative  to  the  532.1  eV  peak  with  increasing  detector 
angle  and  dominates  at  tf  =  70  The  behaviour  of  the  lower  BE 
component  correlates  well  with  the  emergence  and  growth  of  a 
similar  lower  BE  component  of  the  oxide  Si,p  at  higher '/  These 
lower  BE  components  suggest  that  the  upper  surface  region 
consists  of  oxide  with  lower  oxidation  states  than  SiO,  due  either 
to  the  preferential  sputtering  of  the  lighter  oxygen  component  or 
to  the  stopping  depth  of  the  impinging  oxygen  ions  The  ion  beam 
sample  Si,p  oxide  peak  has  BE  =102  9  eV  a  typical  value 
observed  for  very  thin  SiO,  l ref  9). 

The  high  electrical  quality  of  the  ion  beam  grown  oxide  films  is 
best  illustrated  by  the  fabrication  of  n-channel  MOS  transistors 
with  gate  dielectrics  formed  by  ion  beam  oxidation  at  room 
temperature.  The  current-voltage  characteristics  of  such  a 
transistor  are  shown  in  Figure  6.  The  gate  has  a  30  urn  length  and 
a  16  |im  width.  Leakage  current  through  the  gate  is  6  nA  at  1  V, 
however,  this  is  several  orders  of  magnitude  lower  than  the  drain- 
source  saturation  current  at  the  voltages  at  which  the  transistor 
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ion  beam  oxide  t*  is  the  angle  between  the  xurtace  normal  and  the  detector 
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Figure  6.  Current-voltage  characteristics  of  a  MOSFET  fabncated  by  ion 
beam  oxidation  following  post-metallization  annealing  horiz. — 0.2  V- 
div:  vert — 20  uA  div.  0  2  V  step  O.  partial  pressure  6.5  x  10  '  torr 
Ion  energy  60  eV  Time  12  min 
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Figure  7.  C-  l  and  /-  (  characteristics  of  low  energy  ion  beam  oxide  i  solid 
curvesi.  nitride  (dashed  curvesi.  and  control  (dotted  curvesi  MIS 
capacitors  60  eV  O.  for  !  2  min  at  6  5  *  It.)  *  torr  xo  eV  \ .  for  6  mm  at 
3.2  »  10'*  torr  Control  sample  a:  same  conditions  as  oxide  sample  but 
with  shutter  closed 


operates  and  does  not  affect  ns  successful  operation.  The 
breakdown  strength  of  the  oxide  is  at  least  '  «  10”  V  cm  ' 1  The 
saturation  mobility  is  calculated  at  575  cm;  [V  si' 1  making  use  of 
the  value  of  the  capacitance  in  accumulation. 

It  has  been  suggested  that  very  thin  silicon  nitride  films  mav  be 


better  dielectrics  than  silicon  dioxide  films  of  comparable 
thickness.  In  fact  silicon  oxynitride  films  have  been  successfully 
produced  by  5  keV  ton  bombardment  of  silicon10.  We  have 
demonstrated  the  ability  to  produce  nitride  films  on  silicon  at  ion 
energies  below  100  eV.  Figure  7  presents  a  comparison  of  the  C-f 
and  I-V  characteristics  of  aluminium  gate  MIS  capacitors 
fabncated  by  low  energy  ion  beams  at  room  temperature.  The 
control  sample  was  fabricated  by  the  same  procedure  except  that 
it  was  not  directly  bombarded  by  the  ions,  since  the  shutter  was 
not  opened.  The  ohmic  nature  of  the  non-bombarded  metal-sili- 
con  contact  is  expected  and  is  evidence  that  not  simply  charged, 
but  energetic  species  are  necessary  for  the  formation  of  the  thin 
films.  The  figure  also  indicates  that  reverse-bias  leakage  current  of 
the  nitnde  sample  is  lower  than  that  of  the  oxide  by  a  factor  of  two 


Summary 

Device-quality  ultra-thin  dtelectnc  films  have  been  produced  on 
silicon  substrates  bombarded  by  reactive  ions  with  energies 
60-80  eV.  The  damage  induced  by  the  ion  beam  does  not 
significantly  affect  device  operation. 
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Interfacia!  As  is  shown  to  reduce  reverse-bias  current  in  Al-GaAs  Schottky  barriers.  It  is 
suggested  that  the  leakage  reduction  is  associated  with  the  removal  of  low  work  function  phases  at 
the  interface.  In  addition,  current-voltage  measurements  performed  on  In*,  75  Gao  23  As-GaAs 
heterojunctions  indicate  a  dependence  upon  the  condition  of  the  GaAs  prior  to  deposition  of  the 
Ino^jGao  j,  As  layer. 


I.  INTRODUCTION 

Excess  reverse  bias  current  in  metal-semiconductor  con¬ 
tacts  can  degrade  device  performance.  For  example,  gate 
current  limits  field-effect  transistor  (FET)  sensitivity  and 
increases  power  consumption.  In  charge-coupled  device  ap¬ 
plications,  gate  current  can  contribute  to  dark  current  and 
introduce  nonlinearity  in  charge  packets,  thus  reducing  the 
signal-to-noise  ratio.  Reverse  bias  current  reduction  in 
GaAs  devices  requires  insight  into  the  relationship  among 
chemical,  structural,  and  electrical  properties  of  the  metal- 
semiconductor  interface.  A  manifestation  of  the  interaction 
among  these  properties  is  the  phenomenon  of  Fermi-level 
pinning. 

Understanding  Fermi-level  pinning,  however,  remains  a 
formidable  problem.  Models  which  attempt  to  describe  this 
behavior  include  metal-induced  gap  states  (MIGS),’  ex¬ 
trinsic  defect  generation  during  adatom  deposition,2  and  ef¬ 
fective  work  functions  of  mixed  interface  phases  resulting 
from  chemical  reactions  between  metal  and  semiconductor.5 
The  effective  work  function  ( EWF)  model  asserts  that  these 
reactions  primarily  result  in  excess  anion  precipitates  (e  g., 
arsenic)  which  dominate  the  interface  behavior  and  thus 
dictate  the  effective  work  function.  If  low  work  function  re¬ 
gions  exist  at  the  unannealed  interface  they  could  reduce  the 
effective  work  function  and  observed  barrier  height,  notice¬ 
ably  increasing  the  reverse  bias  current.  Ohmic  behavior,  for 
example,  has  been  reported  from  a  low-work  function  phase 
of  Au-Ga  that  forms  at  the  periphery  of  an  annealed 
Au-GaAs  contact.4  Reduction  or  elimination  of  these  low- 
work  function  areas  should  reduce  the  reverse  bias  current. 
This  paper  reports  an  attempt  to  reduce  reverse  bias  current 
in  Al-GaAs  Schottky  diodes  by  creating  a  homogeneous  in¬ 
terface  through  deposition  of  an  arsenic  layer  prior  to  metal¬ 
lization.  In  addition,  the  electrical  properties  of  an 
In<j  75  Gao  23  As-GaAs  junction  have  been  examined  and  are 
shown  to  be  dependent  upon  the  condition  of  the  GaAs  sur¬ 
face  prior  to  deposition  of  the  InGaAs  layer. 

II.  Al-GaAs  JUNCTION 

In  this  experiment,  a  molecular  beam  epitaxy  ( MBE )  sys¬ 
tem  is  used  to  grow  an  Al-As-GaAs  “test"  structure  and  an 


Al-GaAs  “control”  structure.  After  initial  preparation,  an 
n  ”  GaAs  sample  was  mounted  onto  a  Mo  block  "sing  In  and 
loaded  into  the  MBE  chamber.  Subsequent  heating  removes 
the  native  oxide  and  helps  form  a  backside  ohmic  contact.  A 
0.5-^im,  2X  1018  cm-5  n-GaAs  buffer  layer  is  grown  fol¬ 
lowed  by  an  additional  0.5-/im  n-GaAs  layer  doped  at 
2  X  1017  cm  ~ 3 .  The  “test”  structure  was  allowed  to  cool  to 
room  temperature  and  then  exposed  to  an  As-  flux  for  a  time 
corresponding  to  the  deposition  of  100  A  of  As,  assuming  a 
unity  sticking  coefficient.  Prior  to  removal  from  the 
chamber,  1800  A  of  A1  was  evaporated  on  both  samples. 
Diodes  were  defined  by  photolithography  and  wet  chemical 
etching.  The  maximum  temperature  during  these  latter  pro¬ 
cedures  was  90  *C  which  should  avoid  any  annealing  effects 
that  could  degrade  the  As  interlayer’s  homogeneity. 

Current-voltage  (/-F)  characteristics  were  measured  at 
room  temperature,  and  the  barrier  height  and  ideality  calcu¬ 
lated  from  the  forward  bias  data  using  a  least-squares  fit.  A 
typical  comparison  between  the  “test”  and  “control”  struc¬ 
tures  is  depicted  in  Fig.  1 ,  indicating  that  the  sample  contain- 
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Fig.  1  Comparison  between  Al-GaAs  and  Al-As-GaAs  /-Fcharactens- 
tics  showing  large  reduction  in  reverse  leakage  current. 
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Table  I.  Ideality  factor,  barrier  height,  and  leakage  current  density  at  1  V 
reverse  bias  for  representative  samples. 


Sample 

Barrier 
height  (eV) 

Ideality 

Current  density 
at  —  1  V  (mA / 
cm2) 

Al/GaAs 

0  65 

1.06 

31.12 

Al/As/GaAs 

0.70 

1.20 

1.10 

1(a) 

065 

1  47 

0.86 

1(b) 

0  60 

1.03 

3.53 

2(a) 

066 

1.01 

0.11 

2(b) 

0  51 

1.07 

70.70 

ing  the  As  interlayer  exhibits  a  25-fold  reduction  of  reverse- 
bias  current.  Table  I  compares  the  barrier  height,  ideality, 
and  current  density  for  the  two  samples.  The  test  structure, 
w  hile  displaying  less  ideal  behavior,  does  in  fact  have  a  typi¬ 
cally  50-60  mV  greater  barrier  height  to  accompany  the  re¬ 
duced  current.  The  difference  in  barrier  heights  has  been 
confirmed  by  internal  photoemission  measurements.  Al¬ 
though  roughly  an  80  mV  barrier  height  enhancement 
would  be  necessary'  to  account  for  the  current  reduction,  the 
measured  barrier  height  difference  is  within  the  experimen¬ 
tal  error.  Reverse  bias  current  reduction,  therefore,  has  been 
achieved  by  generating  a  more  uniformly  pinned  interface. 

Indeed,  the  coincidence  of  As  and  reduced-dark  current 
has  been  previously  observed" .  Ostensibly  contradictory 
data  for  A1  on  arsenic-rich  GaAs  surfaces  also  exists.'1  Wang 
showed  that  increasing  As-surface  coverage  is  associated 
with  a  concomitant  decrease  in  the  Al-barner  height  to 
GaAs.  It  was  also  shown,  however,  that  the  opposite  is  true 
for  A1  on  AlAs  possibly  due  to  the  fact  that  intermixing  of  A! 
and  Ga  is  precluded  in  the  AlAs  case.  It  is  likely  that  A1  and 
Ga  intermixing  does  not  occur  for  the  thick-As  layer  used  in 
this  experiment  and  thus,  the  apparent  contradiction  can  be 
explained. 


It  might  also  be  possible  to  associate  the  reduced  current 
with  a  metal-insulator  semiconductor  (MIS)  model  of  the 
junction.  This  effect  is  improbable  since  the  current  reduc¬ 
tion  is  a  constant  factor  nearly  independent  of  the  applied 
voltage.  Experiments  to  differentiate  between  various  mod¬ 
els  of  the  reverse  current  transport  mechanism  are  in  prog¬ 
ress. 

111.  In„  7g  Ga0  2$  As-GaAs  HETEROJUNCTION 

The  influence  of  the  GaAs-surface  condition  on  the  mea¬ 
sured  InGaAs-GaAs  barrier  height  was  also  investigated. 
MBE  was  used  to  grow  a  0.5-/rm-GaAs  buffer  layer  on  a 
semi-insulating  substrate  followed  by  a  O.S-fim,  SkIO1” 
cm  “ 5  n- GaAs  layer.  At  this  point,  four  different  sample 
preparations  followed: 

1(a)  Approximately  100  A  of  As  was  deposited  at  room 
temperature  between  the  GaAs  and  the  intrinsic  InGaAs, 
the  latter  of  which  was  grow  n  at  200  °C. 

1(b)  Intrinsic  InGaAs  was  grown  on  the  GaAs  at  450  °C, 

2(a)  The  sample  was  removed  from  the  chamber  and  air 
exposed.  It  was  then  reinserted  and  intrinsic  InGaAs  grown 
at  200  ‘C. 

2(b)  This  sample  was  also  air  exposed.  The  InGaAs- 
growth  temperature,  however,  was  350  °C  which  should  de¬ 
sorb  the  As  oxide  at  the  GaAs  surface. 

In  all  samples,  the  intrinsic  InGaAs  layer  is  0.2-^im.  The 
samples  are  schematically  represented  in  Fig.  2.  After  the 
InGaAs  grow  th,  1 80-/im-diam  Au  contacts  were  evaporated 
onto  the  InGaAs  through  a  stainless  steel  mask  in  a  depo¬ 
sition  system  having  a  base  pressure  of  1  >.  10  Torr.  Prob¬ 
ing  between  Au  contacts  before  mesa  etching  the  InGaAs 
layer  yields  ohmic  behavior  indicative  of  a  low  barrier  con¬ 
tact  of  the  Au  to  the  intrinsic  InGaAs.  After  mesa  etching, 
the  I-V  characteristic  resembles  that  of  two  back-to-back 
diodes.  Ohmic  contacts  were  formed  by  alloying  In  to  the 
GaAs  at  300  "C  in  forming  gas  using  a  rapid  thermal  an¬ 
nealer  Before  measuring  the  rectifying  behavior  of  the 
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Fig.  2.  Schematic  cross  section  of  the 
I  no  y,  Gao  2,  As-GaAs  junctions.  Samples 
1(a)  and  1(b)  were  grown  entirely  under 
UHV  conditions  Samples  2(a)  and  2(b) 
were  air-ex  posed  pnor  to  InGaAs  deposition 
Au  was  evaporated  after  removal  from  the 
MBE  chamber 
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Fig.  3  /-F  characteristics  for  In,,  -<  Gx. As-Ga  As  junctions  indicating  a 
strong  dependence  on  the  GaAs  surface  condition 


InGaAs-GaAs  junction,  the  characteristic  between  two  Au 
contacts  as  well  as  the  integrity  of  the  In  contacts  were  con¬ 
firmed 

T ypical  /-  (-'characteristics  for  the  four  samples  are  shown 
in  Fig.  3.  while  the  barrier  heights  and  ideality  factors  are 
summarized  in  Table  I.  The  samples  possessing  interfacial 
As,  whether  elemental  or  in  oxide  form,  exhibit  current  re¬ 
duction.  Specifically,  the  InGaAs-As-GaAs  sample 
has  four  times  less  current  at  1-V-reverse  bias  than  the 
InGaAs-GaAs  case.  More  dramatically,  for  the  air-exposed 
samples  the  one  in  which  the  As  oxide  was  desorbed  has 
roughly  a  factor  of  640  greater  current  than  the  sample 
grown  at  lower  temperature.  In  fact,  the  air-exposed  low 
temperature  growth  sample  has  the  lowest  reverse  bias  cur¬ 
rent  of  all  samples  tested  as  well  as  the  best  ideality  factor. 
Furthermore,  the  barrier  heights  for  the  InGaAs-As-GaAs 


sample  and  the  air-exposed  sample  possessing  the  As  oxide 
are  nearly  equivalent  to  within  the  10-mV  deviation  in  mea¬ 
surement.  The  lowest  barrier  height  is  associated  with  the 
air-exposed  sample  in  which  the  As  oxide  was  desorbed. 

IV.  SUMMARY 

Leakage  current  in  Al-GaAs  Schottky  diodes  has  been 
reduced  by  the  introduction  of  an  interfacial  As  layer.  While 
this  effect  does  not  preclude  other  models  of  the  metal-semi¬ 
conductor  interface,  it  does  support  the  premises  of  the  effec¬ 
tive  work  function  model  in  which  the  barrier  height  reflects 
an  average  work  function  of  mixed  phases  and  As  is  believed 
to  pin  the  interface  Fermi  level.  The  measured  barrier  height 
of  the  In0-,Gao-?  As-GaAs  heterojunction  has  also  been 
shown  to  depend  upon  interfacial  As.  regardless  of  the  As 
phase.  Investigation  of  leakage  current  reduction  not  only 
lends  insight  into  the  interface  but  may  also  have  practical 
implications  for  threshold  voltage  control  and  sidegating  in 
GaAs  integrated  circuits. 
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Abstract 

Bombardment  of  silicon  surfaces  by  low  energy  oxygen  ions  has 
been  investigated  as  a  possible  process  for  growing  films  of 
Si02  at  room  temperature.  Broad  ion  beams  of  energy  40-200  eV 
and  variable  oxygen  content  have  been  used  to  grow  ultra-thin 
oxides  of  extremely  uniform  thickness.  The  ion  beam  oxides  are 
similar  to  thin  thermal  oxides  in  many  respects  -  composition, 
chemical  binding,  optical  and  electrical  properties.  The 
dependence  of  the  thickness  and  quality  of  the  oxide  films  on  ion 
dose,  ion  energy  and  substrate  temperature  have  been 
investigated.  The  obtained  thickness  is  observed  to  vary  only 
slightly  with  increasing  substrate  temperature  up  to  650°C 
which  indicates  non-thermal  process  kinetics.  The  ion-beam 
oxides  reach  a  limiting  thickness  of  40-60  X  which  is  largely 
independent  of  ion  dose  and  is  also  found  to  be  insensitive  to 
ion  energy. 

The  observed  oxidation  rates  are  explained  on  the  basis  of 
radiation-enhanced  diffusion  and  reaction  processes.  Limited 
thicknesses  are  observed  even  when  sputtering  is  negligible 
because  of  the  decreasing  effective  penetration  of  the  ions  due 
to  the  swelling  of  the  target  which  accompanies  the  conversion  of 
Si  to  SiC>2.  Thus  the  film  grows  until  the  oxide-semiconductor 
interface  moves  beyond  the  current  ion  penetration  depth  after 
which  oxidation  effectively  stops.  This  model  is  equally 
applicable  to  high-energy  high-dose  oxygen  ion  implantation  for 
production  of  buried  oxides  in  SOI  technology  where  it  is 
observed  that  oxide  growth  occurs  predominantly  at  the  upper 
interface . 
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I.  INTRODUCTION 


Further  advances  in  the  development  of  high  density 
integrated  circuits  (ICs)  will  require  scaled  down  device 
dimensions,  closer  device  packing  and  a  greater  number  of 
fabrication  steps.  These  in  turn  require  reduced  1C  fabrication 
thermal  budgets  and,  in  the  case  of  metal-oxide-scmiconductor 
(MOS)  devices,  thinner  gate  dielectrics.  The  trend  toward 
larger  substrates  places  an  additional  requirement  on  the  lateral 
uniformity  of  the  gate  oxides.  The  problem  of  obtaining  high 
quality  thin  films  of  Si02  at  reduced  temperatures  has  been 
approached  from  two  directions  -  oxide  growth  and  oxide 
deposition.  These  differ  in  the  type  of  material  being  supplied 
to  the  substrate  -  oxygen  only  or  both  oxygen  and  silicon.  Both 
approaches  rely  on  the  use  of  some  form,  of  energetic  species  to 
enhance  the  low  oxidation  rates  typically  observed  at  reduced 
temperatures.  Successful  fabrication  of  KDS  devices  has  been 
reported  for  gate  dielectrics  grown  by  plasma  oxidation  at 
500-600°c  and  deposited  by  rf  sputtering  at 
200-300°C  3.  Very  thin  SiC>2  films  of  high  electrical 
quality  have  also  been  deposited  by  plasma  enhanced  chemical 
vapor  deposition  (PE-CVD)  at  350°C  *.  MOS  transistors  have 
been  fabricated  using  very  thin  gate  dielectrics  grown  at  25°C 
by  ion  beam,  oxidation 

Due  to  the  spatial  separation  of  the  plasma  producing  region 
and  the  sample,  ion  beam  oxidation  permits  independent  control 
over  ion  energy  and  ion  flux,  substrate  temperature  and  angle  of 

i  incidence.  These  in  turn  permit  detailed  study  of  the  influence 
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of  the  different  process  parameters  on  the  obtained  oxides,  the 
results  of  which  may  be  extended  to  rf  oxidation. 

The  electrical  characteristics  and  other  properties  of  the 
very  thin  oxide  films  grown  at  room  temperature  by  ion  beam 
oxidation  have  been  reported  previously  Briefly,  these 

showed  that  ion  beam  oxides  are  similar  to  thermally  grown  SiC>2 
films  of  equal  thickness  in  many  respects.  They  have  almost 
identical  Auger  sputter  profiles  except  for  the  apparently 
sharper  interface  of  the  thermal  oxide.  X-ray  photoelectron 
spectroscopy  at  glancing  detector  angles  indicated  the  presence 
of  lower  oxides  of  silicon  at  the  film  surface.  The  most  notable 
difference  between  thermal  and  ion  beam  oxides  is  the  2-3  orders 
of  magnitude  greater  leakage  current  in  reverse  bias  for  the 
latter.  However,  this  leakage  was  another  three  orders  of 
magnitude  lower  than  the  drain-source  saturation  current  and  thus 
did  not  degrade  M0SFE7  per f orm.ance  -* .  In  this  paper  we  will 
discuss  the  dependence  of  the  oxide  growth  kinetics  on  the  ion 
bombardment  conditions  and  will  propose  a  mechanism:  describing 
the  oxidation  process. 

II.  EXPERIMENTAL 

The  test  vehicles  are  (100)  silicon  wafers,  5-cm  diameter, 
polished  on  one  side.  Both  p-type  and  n-type  wafers  of 
resistivity  2-5  fi-cm  are  used.  The  samples  are  prepared  by 
chemical  cleaning  and  the  growth  and  stripping  of  two  thick 
900°C  oxides  after  which  they  are  loaded  into  the  ion  beam 
processing  chamber.  Ion  beam  oxidation  is  performed  using  a 


single-grid  broad-beam  ion  source,  described  previously  7 .  The 
source  operates  on  a  mixture  of  argon  and  oxygen.  It  was  found 
that  a  1:1  Ar:02  ratio  allowed  both  good  control  over  source 
operation  for  extended  periods  of  time  and  led  to  the  production 
of  oxide  films  on  the  substrates.  After  unloading  from  the  ion 
beam  processing  chamber  the  thickness  of  the  obtained  films  is 
measured  with  a  Gaertner  L117  ellipsometer  using  a  He-Ne  laser. 
The  thicknesses  measured  by  ellipsometry  correlate  well  with 
those  obtained  by  Auger  sputter  profiling,  calculated  from  the 
ratio  of  the  XPS  Si  2p  peaks  or  calculated  from  the  capacitance- 
voltage  characteristics  of  MOS  capacitors  6.  The  oxide 
thickness  was  calculated  from  the  ellipsometer  measurements  with 
the  standard  a  priori  assumption  of  the  refractive  index 
necessary  for  ultra  thin  films.  The  index  of  refraction  used  is 
that  of  thermal  Si02.  This  is  justified  by  the  aforementioned 
similarity  between  the  two  types  of  oxides,  further  illustrated 
in  Fig.  1  which  compares  the  unprocessed  ellipsometer  parameters 
L  and  V  .  The  solid  line  shows  the  evolution  of  A  and  V  for 
thermal  oxides  of  increasing  thickness  grown  at  875°C.  The 
numbers  represent  ion  beam  oxides  and  correspond  to  the  oxygen 
ion  dose  received  during  the  exposure.  The  ion  beam  current 
density  was  135  uA/cm^  which  corresponds  to  a  flux  rate  of 
5x10^6  o/cm^-min.  It  is  clear  that  the  two  oxides  evolve 
along  essentially  the  same  lines  pointing  further  to  their 
simi larity . 


III.  RESULTS  AND  DISCUSSION 


The  dependence  of  the  obtained  ion  beam  oxide  thickness  on 
oxygen  ion  dose  is  presented  in  Fig.  2.  The  silicon  wafers  were 
bombarded  by  a  60  eV  ion  beam  containing  O2  and  Ar  in  a  1:1 
ratio.  The  substrate  temperature  was  monitored  by  a 
substrate-holder  mounted  thermocouple  and  is  25°C.  The  figure 
shows  that  there  is  an  approximate  logarithmic  dependence  on  the 
oxygen  ion  dose  though  the  scatter  in  the  data  is  more  pronounced 
at  higher  doses.  This  scatter  is  attributed  to  instabilities  in 
the  ion  source  which  become  more  important  at  the  long  exposure 
times  necessary  to  achieve  higher  doses.  It  appears  that  the 
n-type  wafers  oxidize  somewhat  more  slowly  than  the  p-type 
wafers,  however,  the  differences  are  within  the  experimental 
uncertainty.  It  has  been  observed  previously  that  the  ion  beam, 
oxides  exhibit  very  good  thickness  uniformity  over  the  surface  of 
the  wafer  even  though  they  are  grown  by  an  ion  beam  which  has  a 
peaked  gaussian-like  profile  .  This  insensitivity  to  oxygen 
ion  dose  was  attributed  to  the  attaining  of  a  self-limiting  value 
of  the  thickness.  While  it  is  now  clear  that  such  self-limiting 
behavior  is  not  being  observed,  it  is  also  clear  that  the  growth 
rate  at  high  doses  is  sufficiently  low  so  that  even  two-fold 
increases  in  the  ion  dose  would  lead  to  the  growth  of  only  about 
a  monolayer  of  oxide. 

Se 1 f - 1 imi t ing  oxide  growth  has  been  observed  in  the  case  of 
rf  oxidation  of  lead  ®  and  ion  beam  oxidation  of  nickel  9. 

It  is  attributed  to  the  achieved  balance  between  the  competing 
processes  of  oxidation  and  sputtering.  The  oxidation  rate  is 


initially  large  and  decreases  with  increasing  film  thickness 
whereas  the  sputtering  rate  is  independent  of  oxide  thickness. 
Thus  the  oxide  growth  will  stop  at  a  thickness  value  at  which  the 
two  rates  become  equal. 

It  is  expected  that  self-limitation  will  occur  earlier  at 
higher  ion  beam  energies  where  sputtering  is  more  pronounced. 

That  is  indeed  the  case  as  shown  in  Fig.  3.  The  samples  are 
bombarded  with  a  100  eV  ion  beam  with  a  1:1  Oj:  Ar  ratio  at 
25°C.  It  is  seen  that  at  higher  ion  energies  higher  initial 
oxidation  rates  are  observed.  This  is  probably  related  to  the 
greater  depths  at  which  the  ions  are  stopped  and  the  increased 
probability  that  they  will  be  retained  in  the  target.  The 
oxidation  rate  then  decreases  more  quickly  than  for  6C  eV  ions 
and  appears  to  exhibit  self-limiting  behavior. 

To  investigate  the  importance  of  beam,  heating  of  the 
substrate  during  ion  bombardment  several  samples  were  oxidized  at 
different  oxygen  dose  rates  leading  to  the  same  received  dose. 

The  ion  flux  at  the  substrate  was  varied  by  changing  the  source 
discharge  power  or  by  changing  the  source-to  substrate  distance. 
No  differences  in  the  properties  of  the  obtained  oxides  were 
observed . 

The  substrate  may  be  intentionally  heated  up  to  650°C  by 
means  of  resistive  heating  of  the  substrate  holder.  The  obtained 
oxide  thickness  for  fixed  ion  beam,  conditions  is  observed  to 
increase  with  increasing  substrate  temperature.  However,  this 
dependence  is  rather  weak,  especially  considering  the  large 
temperature  range.  Presenting  the  data  in  an  Arrhenius  plot,  as 


shown  in  Fig.  4,  allows  the  extraction  of  an  activation  energy 
Eact  *or  the  process.  Here  the  oxidation  rate  dependence  has 
been  approximated  by  x  =  C  lnt,  where  t  is  the  oxidation  time 
and  C  is  a  constant  which  is  assumed  to  depend  on  the  temperature 
T  in  the  standard  fashion:  C  =  Ccexp( -Eact/kT) ,  where  k  is 
the  Boltzmann  constant.  Note  that  the  data  is  largely 
independent  of  ion  dose.  The  slope  of  the  obtained  straight  line 
allows  the  determination  of  Eact  =  7  meV.  For  comparison  the 
activation  energy  of  the  linear  rate  constant  B/A  defined  in 
thermal  oxidation  of  silicon  is  2  eV  Clearly,  oxidation 

due  to  ion  bombardment  relies  on  non-thermal  mechanisms  of 
enhancing  the  diffusivity  and  reactivity  of  the  oxidizing 
species . 

The  observed  oxide  thicknesses  correlate  well  with  the 
expected  oxygen  ion  projected  range,  accounting  for  the  volume 
expansion  which  accompanies  the  conversion  of  silicon  to  SiC>2. 
Thus,  the  obtained  thickness  rtay  be  expected  to  increase  at 
higher  ion  energies.  The  dependence  of  oxide  thickness  on  ion 
energy  for  a  fixed  oxygen  ion  dose  is  shown  in  Fig.  5  for  beams 
with  a  50%  oxygen  content.  The  lack  of  a  trend  over  the  covered 
energy  range  is  evident.  It  is  difficult  to  imagine  that  the 
increased  oxidation  rate  is  exactly  counteracted  by  an  equal 
increase  in  the  sputtering  yield  unless  both  processes  are 
governed  by  the  same  mechanism. 


y 
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IV.  PROCESS  MODEL 


Previous  models  of  low  energy  ion  beam  oxidation  of  metals 
describe  the  oxidation  process  in  the  absence  of  sputtering  as 
decreasing  exponentially  with  increasing  exposure  time 
However,  this  model  is  carried  over  from  the  process  of  rf 
oxidation  of  metals  in  which  a  self-biasing  of  the  oxide  layer  is 
observed  and  does  not  account  for  the  fact  that  the 

oxidizing  species  are  actually  deposited  below  the  surface  and 
are  distributed  throughout  the  film.  Also  it  does  not  explain 
the  observed  enhancement  in  the  oxidation  rates  over  thermal 
oxidation  processes. 

It  is  considered  appropriate  to  view  low-energy  ion  beam 
oxidation  of  silicon  as  a  scaled-down  version  of  the  high-energy 
high-dose  oxygen  implantation  processes  used  for  obtaining  buried 
layers  of  Si02  in  silicon-on-insulator  (SOI)  technology.  The 
models  developed  to  describe  this  process  introduce  enhanced 
effective  diffusion  constant  and  reaction  rate  and  also  take  into 
account  the  swelling  of  the  target  12,13.  These  models, 
however,  neglect  any  spatial  dependence  of  the  enhancement  and 
lead  to  the  prediction  of  preferential  growth  toward  the 
substrate  12.  jt  ^as  peen  observed  that  the  growth  of  the 
buried  oxide  layer  takes  place  mostly  at  the  top 
interface  14.1^.  The  preferred  oxidation  upwards  has  been 
attributed  to  the  different  fate  of  silicon  interstitials  at  the 
two  Si-SiOj  interfaces  1®.  The  interstitials  which  are 
emitted  during  the  oxidation  process  can  recombine  at  defects  or 
at  the  surfaces  of  the  wafer.  Due  to  the  much  larger  distance  to 


V 


the  back  surface,  the  interstitials  at  the  bottom  interface  are 
practically  fixed  and  serve  to  block  the  oxidation  process  16 . 

We  propose  an  important  modification  to  these  models.  While 
it  is  clear  that  sputtering  and  oxidation  compete  for  domination 
in  these  processes,  the  explanation  of  enhanced  oxidation  in 
unheated  substrates  is  considered  unsatisfactory.  It  is  proposed 
that  the  enhanced  reactivity  and  mobility  of  the  oxidizing 
species  are  due  to  the  ion  bombardment.  Thus,  at  any  time, 
oxidation  is  taking  place  only  at  those  depths  which  are 
currently  being  reached  by  the  ion  beam..  The  swelling  which 
accompanies  the  conversion  of  silicon  to  silicon  dioxide 
effectively  prevents  later  ion  arrivals  from  reaching  the  same 
depths  in  the  silicon.  Moreover,  it  is  difficult  for  them  tc 
diffuse  deeper  into  the  substrate  after  they  are  stopped  since 
their  diffusion  is  enhanced  only  in  the  region  in  which  the  ions 
are  depositing  energy.  In  other  words,  it  is  important  tc 
recognize  that  the  effective  process  rates  are  spatially 
dependent.  Because  of  the  volume  expansion  this  spatial 
dependence  means  that  the  oxidation  rate  at  the  oxide- 
semiconductor  interface  is  time  dependent  and  oxidation  at  the 
lower  interface  will  stop  when  a  sufficient  portion  of  the 
substrate  has  been  converted  to  SiC>2*  Note  that  the  concept  of 
a  temperature  increase  due  to  thermal  spikes  or  beam,  heating  is 
not  necessary  for  this  treatment.  Enhanced  mobility  and 
reactivity  may  be  due  to  other  beam  related  processes  like  the 
creation  of  excess  numbers  of  defects  in  the  vicinity  of  an  atom 
or  the  transfer  of  excess  energy  to  the  atom  in  a  collision 
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The  proposed  model  accounts  for  the  enhanced  oxidation  rates 
observed  for  unheated  substrates  under  ion  bombardment.  Also,  it 
accounts  for  the  insensitivity  to  substrate  temperature  observed 


in  experiments  on  low  energy  ion  beam  oxidation  of  silicon. 
Further,  it  naturally  explains  the  limited  thicknesses  observed 
with  this  technique  even  in  the  absence  of  sputtering.  The  model 
is  simply  extended  to  the  preferred  growth  toward  the  top  surface 
of  the  wafer  observed  in  the  case  of  buried  oxides  produced  by 
high  energy  oxygen  implantation.  A  quantitative  evaluation  of 
the  process  parameters  based  on  this  model  will  be  published 
separately 

V.  CONCLUSION 

In  summary  the  bombardment  of  silicon  surfaces  by  low  energy 
oxygen  ion  beams  is  observed  to  lead  to  the  growth  of  ultra  thin 
films  of  silicon  dioxide.  The  oxide  growth  is  self -limiting  due 
to  the  competing  processes  of  oxidation  and  sputtering.  Limited 
thicknesses  are  also  observed  in  the  absence  of  sputtering.  The 
oxidation  process  is  explained  in  terms  of  beam-enhanced 
reactivity  and  mobility  of  the  oxidizing  species.  The  importance 
of  recognizing  the  spatial  limitations  of  the  enhanced  oxidation 
process  is  stressed.  The  explanation  of  the  process  is  extended 
to  buried  SiC>2  layers  produced  by  oxygen  implantation. 
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FIGURE  CAPTIONS 


Fig.  1.  Comparison  of  the  evolution  of  thin  875°C  thermal 

(solid  line)  and  25°C  ion  beam  (numbers)  oxide  films. 

The  numbers  correspond  to  the  exposure  in  minutes  to  a  60 
eV  ion  beam  of  1:1  Ar:C>2  content  and  an  oxygen  dose 
rate  of  5xl016  cm”2min"l. 

Fig.  2.  Ion  beam  oxide  thickness  as  a  function  of  oxygen  dose. 

Ion  energy:  60  eV.  Beam  oxygen  content:  50  %.  Substrate 
temperature:  25°C.  O  -  p-type  silicon;  C  -  n-type 
silicon . 

Fig.  3.  Ion  beam  oxide  thickness  as  a  function  of  oxygen  dose. 

Ion  energy:  100  eV.  Beam  oxygen  content:  50  %. 

Substrate  temperature:  25°C. 

Fig.  4.  Arrhenius  plot  for  logarithmic  ion  beam,  oxidation  rate 
constant.  Ion  energy:  60  eV.  Beam  oxygen  content: 

50  %.  Q  -  O  dose:  1.3xl017  cm-2.  O  -  0  dose: 
2.7xl017  cm”2. 

Fig.  5.  Ion  beam,  oxide  thickness  as  a  function  of  ion  energy. 

Beam  oxygen  content:  50  %.  Substrate  temperature: 

25°C.  0  -  O  dose:  3.0xl0*7  cm”2.  ^ 

5.3xl017  cm-2. 
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Abstract 


The  sputtering  rate  for  silicon  dioxide  by  argon  ion  bombardment 
at  energies  appropriate  for  ion  beam  deposition  (<  100  eV)  has  been 
measured.  It  has  been  found  that  the  energy  dependence  of  the  oxide 
sputtering  rate  at  these  low  energies  is  easily  predicted  by  assuming 
the  yield  is  limited  by  the  metallic  component  of  the  binary  target. 
This  assumption  is  shown  to  also  predict  the  sputtering  rate  of  other 
metallic  oxides. 
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Bombardment  of  growing  thin  films  by  low  energy  ion  beams  has 
evolved  into  an  important  technique  for  the  modification  and 
deposition  of  thin  films.  Ion  beam  etching  and  reactive  ion  etching 
have  achieved  widespread  application.  More  recently,  the  use  of 
lower  ion  energies  has  led  to  the  development  of  direct  ion  beam 
deposition  applied  to  low-temperature  epitaxial  growth  of  Si  and 
Ge1-3.  Oxygen  and  nitrogen  ion  beams  have  been  used  for  the 
controlled  growth  of  ultra-thin  oxide  and  nitride  films  on  metal  and 
semiconductor  substrates4-7. 


These  techniques  allow  good  control  over  the  composition, 
stoichiometry,  orientation  and  crystallinity  of  the  obtained  films. 
The  basic  factor  which  limits  the  film  growth  rate  in  ion  beam 
deposition  is  the  sputtering  due  to  the  ion  bombardment.  The  growth 
rate  can  be  approximated  by  dx/dt  =  (1/N)  (  H-Yl  ) ,  where  N  is  the 
density  of  the  material,  ^  is  the  sticking  coefficient  or 
incorporation  probability,  Y  is  the  sputtering  yield  and  £  is  the 
ion  flux.  For  deposition  to  take  place  it  is  necessary  that  Y  <  7  ■ 
In  the  case  of  ion  beam  oxidation,  it  has  been  observed  that  a 
limiting  oxide  film  thickness  is  reached  which  is  related  to  the 
sputtering  rate  of  the  material4'7.  it  is  evident  that  sputtering 
plays  an  important  role  in  film  growth  by  direct  ion  bombardment. 

In  this  letter  we  report  experimental  results  of  sputtering  of 
Si02  by  Ar  ions  at  energies  below  100  eV.  Accurate  knowledge  of 
the  sputtering  rate  and  its  energy  dependence  at  these  low  energies 
is  becoming  increasingly  important  in  microelectronics  device 
fabrication  processes.  The  use  of  inert-ion  bombardment  precludes 
film  growth  and  allows  the  separate  investigation  of  the  sputtering 

component  of  the  deposition  process.  It  is  shown  that  a  simple 
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assumption  leads  to  good  agreement  between  existing  theory  for 
non-binary  materials  and  the  reported  measurements,  not  only  for  the 
case  of  silicon  dioxide  but  also  for  other  oxide  targets. 

The  experiments  are  performed  on  samples  prepared  by  oxidizing 
chemically  cleaned  p-type  and  n-type  Si  (100)  wafers  in  dry  oxygen  at 
900°C  for  times  ranging  between  10  min  and  3  hrs  and  resulting 
oxide  thicknesses  of  45  &  to  520  X.  The  thickness  of  the  oxide  films 
is  measured  with  a  He-Ne  laser  ellipsometer  immediately  before 
loading  them  into  the  ion  beam  processing  chamber.  The  samples  are 
mounted  15  cm  from  the  ion  source  in  a  vacuum  chamber  with  a  base 
pressure  of  2xl0-7  torr.  Broad  Ar+  ion  beams  with  energies  in 
the  range  40-100  eV  are  produced  by  a  single  grid  Kaufman  ion 
source.  The  Si02  samples  are  bombarded  with  argon  doses  ranging 
from  lxlO17  to  1.6xl018  cm-^.  The  thickness  of  the  remaining 
oxide  is  remeasured  by  ellipsometry  after  the  samples  are  unloaded 
from  the  vacuum  system.  The  accuracy  of  the  ellipsometric  data  is 
approximately  1  X,  and  although  it  may  be  distorted  by  damage  to  the 
substrate8,  this  is  not  expected  to  be  significant  in  this  case. 

For  example,  measurements  on  substrates  in  which  the  oxide  has  been 
completely  sputtered  away  indicate  a  "residual"  film  thickness  of 
3-5  X,  attributable  to  a  monolayer  of  native  oxide  grown  after  the 
substrate  is  removed  from  the  vacuum  system. 

The  measured  change  in  oxide  thickness  is  plotted  as  a  function 
of  the  ion  dose  for  each  value  of  ion  energy.  Six  to  ten  different 
data  points  are  plotted  for  each  energy  and  the  value  of  the 
sputtering  yield  is  determined  from  the  slope  of  the  least-squares 
fit  through  the  data  points.  The  data  for  the  case  of  60  ev  Ar+ 
bombardment  is  shewn  in  Fig.  1.  The  thickness  decrease  with 


increasing  ion  dose  is  linear  and  independent  of  the  initial  oxide 
thickness. 


The  measurement  of  the  sputtering  rates  at  energies  close  to  the 
sputtering  threshold  is  difficult  to  measure  due  to  the  relatively 
low  yields  involved.  For  example,  Oostra  et  al.9  did  not  observe 
any  sputtering  of  Si02  at  argon  energies  less ■ than  50  eV  when 
examining  scanning  electron  micrographs.  It  is  believed  that  the 
method  used  here  allows  significantly  more  precise  measurement  of  the 
sputtering  rate,  and  material  removal  has  been  observed  at  energies 
as  low  as  40  eV. 

The  experimental  results  on  sputtering  Si02  by  Ar  ions  are 
summarized  by  Fig.  2.,  where  they  are  compared  to  the  experimental 

data  of  Jorgenson  and  Wehner1^  and  the  higher  energy  data  of  Oostra 

9 

et  al.  Note  that  although  widely  cited,  the  data  of  Jorgenson  and 
Wehner  is  obtained  under  quite  different  experimental  conditions  with 
the  oxide  film  covering  a  thin  tungsten  wire  immersed  in  a  DC 
discharge . 

Theoretically,  the  sputtering  of  monatomic  targets  is  well 
understood-*-!  and  good  descriptions  of  the  sputtering  yield  near 
threshold  can  be  obtained  by  semi-empirical  f ormulas-1-^"!4  which 
introduce  modifications  of  Sigmund's  theory  to  account  for  the 
existence  of  a  sputtering  threshold.  Very  good  agreement  with 
experimental  data  is  generally  observed. 

The  sputtering  process  for  multicomponent  targets  is  more 
difficult  to  describe.  The  collision  cascade  process  will  affect 
each  target  component  differently.  Apart  from  causing  the  ejection 
of  atoms  from  the  target  surface,  the  ion  bombardment  may  also  lead 

to  target  mixing,  segregation,  enhanced  diffusion,  and  other 
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processes.13  Thus,  the  correct  description  must  take  into  account 
the  changes  in  composition  of  the  target  brought  about  by  the  ion 
bombardment  and  how  these  changes  are  reflected  in  the  sputtered 
particle  flux.  This  discussion  will  concern  only  the  sputtering  of 
oxides;  the  behavior  of  compounds  or  alloys  consisting  only  of 
low-vapor  pressure  materials  may  be  quite  different.16 

In  this  letter,  the  simple  assumption  is  made  that  oxygen  is 
initially  preferentially  ejected  and  that  the  non-volatile  component 
(silicon)  limits  the  sputtering  process.  This  assumption  is 
supported  by  prior  experimental  evidence  that  the  sputtering  of 
metallic  oxides  by  low  energy  (<1  keV)  ions  leads  to  the  enrichment 
of  the  metal  component  within  several  monolayers  of  the  surface16. 
This  effect  is  expected  to  become  especially  pronounced  at  energies 
near  the  metal  sputtering  threshold,  in  agreement  with  the  general 
observations  on  sputtering  of  binary  targets  whose  components  have 
different  sputtering  threshold  energies.1^ 

Further  support  for  this  assumption  comes  from  previously 
reported  experimental  data  concerning  the  sputtering  yield  of  Ta  and 
Ta205.17"^°  Our  assumption  correctly  predicts  that  the  removal 
of  equal  numbers  of  tantalum  atoms  leads  to  a  2.48-fold  decrease  in 
the  volume  of  Ta2C>5  as  compared  to  Ta,  for  100  eV  Ar+  and 
500  eV  He+  bombardment.  This  ratio  degrades  with  higher 
bombardment  energies  since  the  ejection  of  the  material  takes  place 
from  a  layer  whose  composition  can  no  longer  be  approximated  as  pure 
tantalum. 

Applying  this  assumption  to  the  case  of  low-energy  argon 
sputtering  of  silicon  dioxide  leads  to  similarly  correct 
predictions.  The  removal  of  the  same  number  of  silicon  atoms  will 
decrease  the  volume  of  a  SiO?  sample  2.21  times  the  change  in  the 


volume  of  a  silicon  sample.  Thus,  using  the  sputtering  yield  of 
silicon  calculated  according  to  Matsunami  et  al.13  with  the 
threshold  energy  Et^  =  27  eV  as  suggested  by  Yamamura  and 
Bohdansky14  leads  to  excellent  agreement  with  the  experimentally 
obtained  data,  as  shown  in  Fig.  3.  The  inadequacy  of  the  assumption 
at  higher  energies  is  pointed  to  by  the  slight  deviation  of  the  data 
from  the  predicted  curve. 

In  summary,  the  sputtering  yield  of  Si02  under  low  energy  argon 
ion  bombardment  has  been  measured  in  the  range  40-100  eV.  An 
assumption  for  understanding  the  energy  dependence  in  terms  of  low 
energy  silicon  sputtering  with  a  simple  volumetric  rate  adjustment 
leads  to  excellent  prediction  of  the  experimental  data.  The 
assumption  has  been  extended  to  predicting  other  oxide  sputtering 
rates  with  equal  success. 
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Figure  Captions 

Fig.l.  Measured  decrease  in  the  thickness  of  the  Si02  film  as  a 
function  of  Ar  dose.  E  =  60  eV,  Ar+  flux  =  2.2xl015 

cnT^sec"! . 

Fig. 2.  Experimental  results  on  sputtering  of  silicon  dioxide  by  low 
energy  argon  ions. 

Fig. 3.  Comparison  of  calculated  and  measured  sputtering  rates  for 
Ar+-bombarded  SiC^. 
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Abstract 


The  self  -  limiting  oxidation  of  silicon  by  a  low  energy  ion  bear. 
(40-12C  eV )  is  described  by  an  implantation-sputtering  model.  The 
thin  oxide  (40-50  X)  is  grown  primarily  by  a  surface  implantation 
process  which  leads  to  a  logarithmic  increase  of  oxide  thickness  with 
dose  in  the  absence  of  sputtering.  At  higher  energies  (10C  eV),  the 
sputtering  of  the  growing  film  leads  tc  net  self -limiting  growth. 

The  model,  which  does  net  include  adjustable  parameters,  is  used  tc 
describe  the  dose  evolution  of  the  oxide  growth  as  a  function  of  bear, 
energy.  The  implantation- sputter ing  model  is  found  to  be  in 
excellent  agreement  with  experimental  observations. 


The  room  temperature  oxidation  of  silicon  by  the  use  of  a  broad, 
low-energy  ion  beam  containing  a  mixture  of  argon  and  oxygen  has  been 
shown  to  produce  an  oxide  suitable  for  use  as  a  gate  dielectric  in 
metal-oxide-semiconductor  (MOS)  transistors . * • 2  The  ion  beam 
energy  is  typically  in  the  range  of  40-120  eV  with  doses  of  the  order 
1017/cm2  delivered  in  3-5  minutes,  leading  to  oxides  of  thickness 
40-50  X.  The  oxides  are  found  to  be  stoichiometric  Si02  as 
measured  by  Auger  electron  spectroscopy  and  x-ray  photoelectron 
spectroscopy.  The  thickness  of  the  films  is  measured  after  growth 
using  several  techniques  -  the  primary  one  being  ellipsometry3 . 

The  film  thickness  has  a  weak  dependence  on  substrate  temperature  in 
the  range  25°C-650°C,  on  dose,  and  beam  energy.4  Tne  oxide 
thickness  uniformity  as  measured  by  test  sites  across  a  wafer  is 
excellent  and  far  exceeds  the  dose  uniformity.  This  letter  presents 
an  implantation-sputtering  model  to  describe  the  oxidation  process. 
The  model  predictions  are  in  excellent  agreement  with  the  above 
experimental  observations. 

In  the  implantation-sputtering  (I-S)  model,  the  oxide  growth  is 
governed  by  a  surface  implantation  process,  rather  than  by  thermal 
diffusion.  This  is  in  contrast  to  a  previous  model  suggested  for  the 
low-energy  ion  beam  oxidation  of  metals.-3  The  range  distribution 
of  the  oxygen  ions  in  the  target  is  calculated  using  linear  cascade 
theory. ^ Energy  loss  occurs  entirely  by  elastic  binary 
collisions  with  the  target  nuclei.  The  target  is  treated  as 
homogeneous,  isotropic  and  monatomic  with  target-atomi  mass  and  atomic 
number  determined  according  to  Bragg's  rule  and  taking  into  account 
the  gradual  transformation  of  the  target  from  silicon  to  silicon 


dioxide.  The  interatomic  potential  used  is  the  universal 


potential.8  Power-potential  approximations7  obtained  by  a 
least-squares  fit  to  this  potential  are  used  to  allow  analytical 
solution.  Thus,  m=l/5  is  used  to  fit  the  potential  at  reduced  energy 
in  the  range  10"8-2xl0"3.  The  mean  and  the  straggling  of  the 
distribution  are  calculated  and  it  is  fitted  by  a  simple  Gaussian. 

The  results  of  this  calculation  in  the  case  of  50  eV  O  bombardment  of 
SiC>2  are  presented  in  Fig.  1.  This  corresponds  to  the  case  of 
100  eV  O2  ion  beam  bombardment.  It  should  be  noted  that  while  the 
mean  range  is  predictably  small  (approximately  5  %.)  the  straggling  is 
significant.  It  is  this  straggling  tail  which  governs  the  oxide 
growth  evolution. 

In  the  I-S  model,  the  oxide  growth  competes  with  the  simultaneous 
sputtering  of  the  oxide  film,  by  the  same  beam,.  Since  the  experiments 
are  primarily  performed  with  5C%-50%  argon-oxygen  mixtures,  the 
sputtering  of  the  target  is  due  mainly  to  the  argon  bombardment.  The 
sputtering  rate  of  Si02  under  low  energy'  argon  ion  bombardment  has 
been  measured  in  a  separate  set  of  experiments9  and  the  results  are 
shown  in  Fig.  2.  Sputtering  at  60  eV  can  be  considered  negligible 
for  the  oxide  thicknesses  considered,  but  becomes  appreciable  by 
100  eV. 

A  computer  simulation  of  the  oxide  growth  evolution  has  been 
performed  using  the  I-S  model.  The  incorporation  of  oxygen  into  the 
’  arget  and  the  conversion  of  Si  to  SiC>2  is  accompanied  by  volume 
'-xpansion.  To  account  for  this,  the  evolution  of  the  oxide  layer  is 
alcuiated  in  small  time  (oxygen  dose)  increments.  The  range 
'  ; stribution  is  recalculated  for  each  time  interval.  It  is  assumed 


that  the  implanted  oxygen  reacts  immediately  with  unreacted  silicon, 
since  the  former  may  be  considered  "hot"  in  the  last  stages  of 
stopping.  However,  oxygen  coming  to  rest  within  stoichiometric 
Si02  is  assumed  to  outdiffuse  to  the  surface.  In  thicker  oxides, 
consideration  of  a  spatially  dependent  diffusion  coefficient  may 
become  necessary  in  view  of  the  depth  dependence  of  the  beam  energy 
deposition  and  consequent  defect  production. 

The  stoichiometry  of  the  film  as  a  function  of  depth  for 
increasing  dose  is  shown  in  Fig.  3  for  the  case  of  100  ev  O2 
bombardment.  An  interesting  feature  of  the  evolution  is  apparent. 

It  is  seen  that  the  interface  between  oxide  and  silicon  is  initially 
broad,  but  as  the  surface  erodes  due  to  sputtering,  the  interface 
becomes  more  sharp.  The  width  of  the  interface  explains  electrical 
characterization  results,  such  as  lower  channel  mobility  in  ion  beam 
oxide  MOS  transistors,  and  anomalously  larger  capacitance-voltage 
measurements  of  oxide  thickness.1® 

The  simulated  dependence  of  the  oxide  film  thickness  on  ion  dose 
for  100  eV  bombardment  is  presented  in  Fig.  4  where  it  is  compared  to 
experimental  results.  Since  the  experimental  data  is  obtained  by 
ellipsometry ,  the  simulated  thickness  is  determined  in  two  ways.  In 
the  low  dose  case  where  no  stoichiometric  oxide  has  been  formed,  the 
thickness  is  plotted  according  the  position  of  the  steepest  gradient 
in  oxygen  concentration,  which  is  approximately  halfway  through  the 
transition  region.  For  higher  doses ,  in  which  stoichiometric  S1O2 
has  been  formed,  the  edge  of  the  stoichiometric  region  is  used  to 
assess  the  thickness.  For  60  ev  bombardment  simulations  in  which 
sputtering  is  negligible,  the  film  thickness  grows  logarithmically 


with  dose,  and  is  also  in  excellent  agreement  with  experimental 
observations. 

The  simulated  final  oxide  thickness  as  a  function  of  ion  beam 
energy  is  shown  in  Fig.  5.  Also  shown  is  the  experimental  data, 
which  shows  good  agreement.  Note  that  the  experimental  and  simulated 
data  show  a  maximum  in  thickness  for  the  50-60  eV  energy  range.  The 
maximum  is  a  result  of  the  competing  processes  of  implantation  and 
sputtering.  The  calculated  thickness  for  140  eV  bombardment  is  lower 
than  the  experimental  data  due  to  an  overestimation  of  sputtering  in 
the  I-S  model.  The  low  energy  ion  beam  sputtering  model  of  oxides 
assumes  that  sputtering  of  oxygen  is  not  rate  limiting  -  an 
assumption  which  begins  to  fail  at  higher  energies.® 

In  summary,  an  implantation-sputtering  model  without  adjustable 
parameters  has  been  presented  for  the  low  energy  ion  beam  oxidation 
of  silicon.  The  model  explains  the  weak  dependence  of  oxide 
thickness  on  temperature,  energy,  and  dose.  The  excellent  agreement 
between  simulation  and  experiment  allows  the  important  conclusion 
that  linear  cascade  theory  and  the  binary  collision  approximation 
continue  to  be  applicable  at  these  low  ion  energies. 

The  authors  gratefully  acknowledge  useful  discussions  with  D.v. 
Rossi  at  Columbia,  J.M.E.  Harper  at  IBM  Research,  and  the  support  of 
the  Joint  Services  Electronics  Program  under  contract  DAAG29-85-K0049 
and  an  IBM  Faculty  Development  Award. 
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FIGURE  CAPTIONS 


1.  Normalized  range  and  deposited  energy  distributions  calculated 
for  50  eV  0+  on  Si02- 

2.  Sputtering  rate  for  low  energy  argon  ion  bombardment  of  silicon 
dioxide.  Rate  is  measured  in  angstroms  sputtered  per  dose  of 
1017  incident  ions  per  cm^ . 

3.  Simulation  of  oxide  growth  evolution.  Plot  shows  bonded  oxygen 
concentration  as  a  function  of  depth  for  10C  eV  C>2+  ion 
bombardment.  Numbers  refer  to  increasing  doses. 

4.  Oxide  thickness  as  a  function  of  dose  measured  experimentally  a 
simulated  using  the  implantation-sputtering  model  for  IOC  eV 
02+  bombardment. 

5.  Oxide  thickness  as  a  function  of  ion  energy  measured 
experimentally  and  simulated  using  the  implantation-simulation 
model.  O2  dose:  3x1Q17  cm-^. 
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USING  INCOHERENT  LIGHT  TO  GENERATE  COHERENT  EXCITATIONS 


S.  R.  Hartmann 

Physics  Department,  Columbia  University,  New  York,  N.Y. 

ABSTRACT 

Photon  echo  generation  by  incoherent  light  is  both  practical  and  efficient.  In  the 
regime  where  the  noisy  excitation  pulses  overlap  fast  transient  response  is  obtained  and  in 
addition  psec  beating  is  observed  in  Na  vapor ,  the  latter  arising  from  coherent  superposition 
states  associated  with  the  D  line  transitions.  Only  the  fast  response  requires  broadband  noisy 
light:  ultrafast  modulated  echoes  are  produced  with  either  broadband  noisy  light  spanning  the 
D  line  transitions  or  with  a  pair  of  narrow  band  lasers  resonant  with  each  of  the  D  line 
transitions. 


INTRODUCTION 

The  photon  echo  effect  is  normally  thought  of  as  a  coherent  effect  brought  about  by 
the  interaction  of  an  ensemble  of  atoms  or  molecules  with  a  sequence  of  laser  pulses  which 
must  be  coherent. '  This  notion  is  not  altogether  correct  as  was  demonstrated  in  '84  when  an 
echo  experiment  was  performed  with  the  usual  coherent  laser  sources  replaced  by  broadband 
amplified  spontaneous  emission  sources*  Not  only  was  it  possible  to  generate  echoes  in  this 
unorthodox  manner  but  the  echoes  so  produced  were  larger  than  those  obtained  using  coherent 
laser  pulses. 

Although  usually  not  explicitly  stated  it  is  understood  that  the  laser  excitation 
pulses  must  be  resonant  with  the  optical  transition  to  be  excited.  A  fuller  appreciation  of  this 
obvious  requirement  leads  to  an  understanding  of  the  unexpected  effectiveness  of  echo 
generation  with  noise.  With  this  motivation  we  ask  the  question.  What  constitutes  resonant 
radiation?  For  the  case  in  which  all  homogeneous  relaxation  times  are  long  compared  to  the 
laser  pulse  width  and  the  lasers  are  weak  so  that  power  broadening  is  negligible,  then  all 
atoms  within  a  bandwidth  Aw  -  1  /Tp,  where  Tp  is  the  duration  of  a  coherent  laser  pulse  at 
frequency  w,  interact  strongly  with  the  radiation  field.  Outside  this  regime  the  interaction 
falls  quickly  and  can  therefore  be  called  nonresonant.  A  stronger  statement  than  this  can  in 
fact  be  made  which  is  that  the  interaction  strength  within  this  bandwidth  is  uniform.  That  is, 
an  atom  at  exact  resonance  and  one  removed  by  Aw  from  exact  resonance  experience  a  pulse 
with  the  same  area  6.  But  our  interest  is  with  incoherent  radiation  pulses.  We  now  note  that  if 
such  pulses  have  a  duration  Tp  then  any  radiation  sampled  for  that  duration  and  lying  within  a 
bandwidth  no  greater  than  Aw  =  1  /Tp  from  exact  resonance  must  appear  coherent.  This  is  an 
uncertainty  argument  result.  IVfollows  that  any  particular  atom  sees  the  applied  incoherent 
laser  field  as  a  coherent  one.  It  also  follows  that  ail  atoms  within  the  common  bandwidth  Aw 
see  the  same  coherent  field  and  behave  as  a  coherent  group.  With  these  results  it  is  clear  that 
echo  production  with  incoherent  light  is  to  be  expected  Each  group  of  8ioms  within  a 
bandwidth  Aw  acts  as  a  unit.  If  fi{ is  the  coherent  dioole  momeat  density  of  one  suer  u»v‘ 
the  raoiateo  intensity  Ig-vg  of  the  atom  ensemble  varies  as  N  Pg^p  where  N  is  aQ/Aw  .  the 
number  of  units  within  tna  linewidth  aQ. 

Echo  experiments  on  the  Na  D  lines  in  Na  vapor  using  single  mode  coherent  7 
nanosecond  laser  pulses  can  only  excite  the  fraction  Aw/aQ  »  I  /30  of  the  resonance  line. 
Incoherent  echo  generation  gains  us  a  factor  of  30  by  utilizing  all  the  atoms  in  the  resonance 
line.  The  statistical  nature  of  the  incoherent  light  leads  to  a  reduction  to  33*  in  the  averoge 
value  of  /L^p*  but  the  overall  advantage  of  using  incoherent  light  in  this  situation  still 
correspond  to  a  factor  of  ( 1  /3)(  30)  or  10.  in  the  echo  experiments  we  had  performed  using 


coherent  light  the  lasers  ran  in  several  modes  and  so  we  only  achieved  an  enhancement  factor 
of  3  when  incoherent  sources  were  used.  We  constructed  the  incoherent  sources  by  taking 
ordinary  dye  lasers  and  removing  all  elements  which  might  define  a  cavity.  Over  the  region 
where  these  incoherent  sources  lased,  their  output  was  effectively  thermal. 

FOUR  WAVE  MIXING 

The  object  of  these  experiments  was  to  establish  that  echo  generation  by  incoherent 
light  was  both  possible  and  practical.  As  an  adjunct  to  this  work  a  four  wave  mixing  signal  was 
developed  in  Na  vapor  at  a  temperature  well  above  445  K  with  temporally  overlapping  pulses 
from  a  single  intense  incoherent  laser  ( thermal)  source.  The  source  output  was  split  to 
produce  two  separate  pulses  we  label  1  and  2  end  then  recombined  after  introducing  an  optical 
delay.  The  result  is  shown  in  figure  1 . '  When  the  delay  between  pulse  1  along  k  and  2  along 

k  ♦  K  exceeds  the  7 nsec  duration 
of  these  pulses  the  resulting  four 
wave  mixing  signal  along  k  ♦  2IC 
is  just  the  ordinary  photon  echo. 
The  data  we  show  here  goes  only  to 
the  7nsec  limit  The  purpose  of 
this  experiment  was  to  show  that 
no  well  defined  laser  modes  existed 
in  our  amplified  spontaneous 
emission  laser  source.  Had  they 
been  present  there  would  have 
been  a  beat  structure  on  the  four 
wave  mixing  signal.  The  rapid  fall 
o.  i.o  2.o  3.o  4.o  i.o  6.o  7.o  in  intensity  for  negative  T  occurs 

Delay  Between  Pulac  2  and  Pulce  2  (nsec)  in  the  order  of  a  1 00  psec.  For 

FIG  t  Four-wave  mixed  signal,  generated  by  use  of  positive  T  there  is  8  Signal  fall  off 

one  thermal  source,  as  a  function  of  pulse  separation.  with  B  time  constant  Of  the  order  of 

The  absence  of  any  large-scale  beating  demonstrates  that  8  few  hundred  pS8C  followed  by  8 

there  is  no  stable  mode  structure  in  the  thermal  excita-  Slower  fall  Off  due  to  the 
non  pulses  diminishing  of  the  temporal 

overlap  of  the  excitation  pulses. 

At  the  time  of  this  work  parallel  experiments  were  being  carried  out  by  Yaj  ima  et 
al^  and  Asaka  et  aP.  The  time  delayed  four  wave  mixing  experiments  of  Yajima  et  al.  used 
picosecond  noise  pulses  and  the  experiments  of  Asaka  et  al.  used  a  noisy  cw  laser,  in  both  these 
cases  the  laser  excitations  were  weak  and  a  perturbation  theory  was  suitable. 

In  an  effort  to  understand  better  the  interaction  of  noistlight  with  matter  we 
proceeded  to  study  four  wave  mixing  with  intense  radiation  fields.  We  further  broadened  the 
spectral  width  of  our  incoherent  source,  and  reduced  the  temperature  of  the  Na  vapor  to  445  K 
so  that  collisional  broadening  effects  were  negligable.  We  found  that  as  had  been  reported  by 
Yaj  ima  we  obtained  beats  in  the  four  wave  mixing  signal  when  we  varied  the  time  delay  X. 
These  beats  correspond  to  the  6  A  splitting  of  the  Na  D  lines  6  Whereas  our  incoherent  echo 
experiments  were  performed  using  light  with  a  bandwidth  of  only  a  few  GHz  our  four  wave 
mixing  experiments  with  intense  fields  twd  a  bandwidth  of  1 2  A,  sufficient  to  excite  both  D 
lines.  Our  results  are  shown  in  figure  2.^  These  results  are  similar  to  those  of  figure!  except 
for  the  dramatic  1 .9  psec  beating.  The  fast  fall  off  in  intensity  S  for  positive  x  wasn't  expected 
as  we  now  worked  in  a  collision  free  regime  where  the  result  of  Moritaand  Yaj  ima' ,  valid 
when  the  longitudinal  and  transverse  relaxation  times,  T  |  and  T2,  are  large  compared  to  the 
inverse  1  /e  half  width  of  the  inhomogeneous  line,  Suf 1 ,  applies,  as  was  the  case  in  our 
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experiment.  Our  observed  degradation  in  S  for  large  I  is  due  to  laser  noise  induced  relaxation 

effects  and  is  a  subject  of 
continuing  interest.®  Our 

j  discussion  here  will  be  limited  to 

1  -j  explaining  why  coherent  signals 

i  i.o -j  should  be  produced  by  broadband 

t  ]  .  noise  and  to  what  extent  the 

:  6.0  j  \  broadband  noise  is  essential  in 

5  effecting  a  particular  result.  We 

s  i.o  rM  '  •'  .•••> -4,  limit  ourselves  to  the  weak 

:  j  _  _  excitation  limit  and  we  refer  the 

|  :.oj'  reader  to  the  work  of  Mori ta  and 

]  j  Yaj  ima  for  an  analytic  analysis  of 

o.o  j - - -  many  aspects  of  this  problem. 
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Pulse  Separation  (ptec)  SIMPLE  ANALYSIS 

First  we  present  a 

I  simple  picture  of  the  time  delayed 

1  ,.0i  j,i  ii  i,|  slim  ,  four  wave  mixing  process.  We 

-  ii  i'l'i  fiy  if  lip  ‘Mill  begin  by  modeling  the  incoherent 

:  6.o,H1l;jiiJ  1  Iff  II  |  III  j!  If  laser  excitation  as  a  series  of  delta 

t  iijilr1/  ml  f'jlf  U  -I  Ml '  :  function  spikes  randomly  phesed. 

S  Hr'  The  laser  output  is  divided  and 

~  !  'll  '  recombined  so  that  the  sample  is 

l  2  u  j  irradiated  with  two  identical  laser 

*  j  pulses  directed  Blong  k  and  It  ♦  1C 

s  r  [ _ _ _ _  with  relative  delay  X.  These  two 

"  •-  «o  «o  ;oo  spike  trains  are  pictured  in  the 

Pul**  Separation  (p»«c)  upper  part  of  figure  3.  The  lower 

part  of  this  figure  shows  two 

FIG.  a  Experimentally  measured  signal  intensity  vs  exci-  recoil  diagrams  .  In  the  Upper 

tation  pulse  separation  in  a  10-cm-long  Na  cell  at  -445  K  <b'  reCOll  diagram  the  Spike  along  k 

The  same  dau  for  pulse  separations  of  50—100  psec  with  error  preceeds  the  spike  along  k  ♦  K 

bars  and  lines  connecting  the  dau  The  l.l-psec  modulation  is  While  in  the  lower  fUCOi  1  diagram 

due  to  the  Na  3  P  fine-structure  beating.  ft  follows  it.  In  all  parts  Of  the 

figure  the  time  variable  is 
synchronized  and  increases  to  the 
right.  Let's  consider  the  upper  recoil  diagram.  It  shows  the  effect  of  the  first  four  noise  spikes 
along  k  seoarately  generating  excited  state  amplitudes  which  recoil  until  deexcited  by  the 
corresponding  noise  spues  along  k  ♦  It.  in  this  manner  a  macroscopic  ground  state  amplitude 
with  momentum  corresponding  to  -K  is  built  up.  The  essential  thing  here  is  that  although  the 
noise  spikes  in  each  directed  pulse  train  are  randomly  phased  there  is  a  fixed  phase  difference 
between  noise  spikes  separated  by  x  in  the  two  pulse  trains.  When  we  consider  the  effect  of  a 
noise  spike  along  k  ♦  K  acting  on  the  ground  state  amplitude  the  effect  is  as  shown  in  the 
central  section  of  the  upper  recoil  diagram.  An  excited  state  amplitude  is  generated  which 
recoils  along  k  ♦  K  and  collides  x  later  with  the  ground  state  amplitude  recoiling  along  -1C. 

The  result  is  the  formation  of  a  macroscopic  dipole  moment  with  wave  vector  k  *  K  -( -K)  * 


FIG.  3.  '»■  Experimentally  measured  signal  intensity  vs  exci¬ 
tation  pulse  separation  in  a  10-cm-long  Na  cell  at  445  K  <b' 
The  same  dau  for  pulse  separations  of  50—100  psec  with  error 
bars  and  lines  connecting  the  dau  The  l.b-psec  modulation  is 
due  to  the  Na  3  P  ftne-structure  beattnc. 


'  w.  - 


>  •>  >  v  V  V  V  V  V  V.V  V,  ‘ 


555 mzm 


->T>-  tcJ  • 


y  rk<2K  /--k 


/  /  /  /  / 

/  /  /  / _ i 

*  i  *  *  f 

i— --  k  i-k-K 


/—  k*K 


JJJJL 

•  *  *  * 


FIG.  3.  Upper  section:  Two  Identical  leser  spike  trains 
wit/i  relative  delay  T  along  k+k  end  k.  Lower  section: 
Recoil  diagrams  showing  signal  generation  along  k+2K  for 
positive  T  (above)  end  for  negative  T  (below).  Time 
increases  to  the  right  in  all  cases. 
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FIG.  4.  Recoil  diagram  showing  origin  of  ground 
superposition  state  modulation. 


k  ♦  2t  In  the  phase  matched 
direction  in  which  we  look.  Each 
noise  spike  along  k  ♦  l  therefore 
generates  a  macroscopic  dipole 
moment  which  radiates.  Since 
these  noise  spikes  are  randomly 
phased  the  signal  we  see  is  the  sum 
of  the  intensities  which  would  be 
radiated  separately  by  each  of  the 
induced  dipole  moment  densities. 
The  picture  we  have  presented 
corresponds  to  a  series  of 
accumulated  photon  echoes  which 
combine  incoherently  to  yield  the 
result  expressed  above  in  Eq(  1 ). 
The  signal  falls  to  half  its 
assymptotic  limit  at  X  -  0  since 
the  echo  amplitude  is  fully  formed 
on  application  of  the  noise  spike 
along  k  ♦  K  end  the  contribution 
to  the  time  integral  of  the  echo 
signal  during  its  formation  is  not 
obtained.  For  negative  T  a  similar 
analysis  using  the  lower  recoil 
diagram  obtains  and  the  resulting 
signal  is  seen  to  be  best 
interpreted  as  a  series  of 
incompleted  free  decays. 

The  1  9  psec  beets  we 
observed  can  be  expleined  in  terms 
of  the  macroscopic  ground  state 
population  recoiling  along  -K 
being  modulated  at  the  difference 
frequency  of  the  sodium  D  lines. 
Consider  figure  4  which  shows  the 
recoil  diagram  appropriate  when 
two  separate  transitions  are 
excited  by  the  same  delta  function 
spike.  Each  excited  state  amplitude 
evolves  according  to  its  energy 
eigenvalue  with  the  result  that  the 
ground  state  formed  T  later  has  an 
amplitude  proportional  to 
[exp(-iOl)  +  exp(-iO'T)]  ~ 
cos[(  Q-Q')T/2)  where  0  and  O' 
are  the  eigenfrequencies  associated 
with  the  two  excited  states.  As 
before,  the  effect  of  a  leter  spike 
along  k  +  K  is  to  generate  a 
recoiling  state  which  collides  with 
the  reformed  ground  state  to 
produce  an  echo.  This  ground  state 
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having  an  amplitude  which  is  modulated  at  half  the  difference  frequency  of  the  two  excited 
states  gives  rise  to  an  echo  intensity  modulated  by  cos[(  O-O' )t].  As  can  be  seen  from  the 
figure  this  is  true  whether  the  interrogation  spike  which  produces  the  recoiling  excited  state 
(states)  is  resonant  with  either  of  the  two  transitions  or  both. 

The  virtue  of  the  recoil  diagram  approach  is  that  it  provides  a  graphic  picture  of  the 
echo  formation  or  time  delayed  four  wave  mixing  process.  If  relaxation  processes  are 
introduced  then  the  radiated  signal  will  diminish  with  increasing  1  because  the  superposition 
states  will  dephase  reducing  the  echo  formation  directly  as  well  as  indirectly  through  an 
incomplete  formation  of  the  macroscopic  ground  state.  It  appears  th8t  this  technique  may  be 
well  suited  for  the  measurement  of  fast  relaxation  phenomena.  0 

ULTRAFAST  SPECTROSCOPY  WITH  NARROW  BAND  LASERS 
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The  incoherent  echo  analysis  was  motivated  by  the  realization  that  only  radiation 
resonant  with  a  transition  will  interact  with  it  This  observation  suggests  an  alternative 
method  for  generating  the  psec  beats  observed  above.  The  method  is  to  replace  the  broadband 
incoherent  laser  whose  spectral  output  covers  both  lines  with  two  narrowband  coherent 
lasers  each  resonant  with  one  of  the  transitions. 1 1  The  output  of  these  lasers  is  combined  and 
treated  as  a  single  pulse.  We  performed  a  regular  two  pulse  photon  experiment  using  regular 
dye  lasers  with  7  nsec  duration  pulses.  As  expected  we  found  that  echoes  are  produced  which 

are  in  all  respects  similar  to 
echoes  previously  produced  except 
/  /  that  when  we  vary  the  pulse 

*  ✓  x-ct  Xx*c(t-D  separation  T  the  overall  echo 

/  /  intensity  varies  according  to 

/  t  cosKO-Q'ft].  Thus  the  psec 

/  ^ 1  1  beeting  is  obtained  with  narrow 

-  /  \  band  excitation.  This  beating  is 

/  observed  even  when  a 

/  :  /  spectrometer  is  introduced  in 

*  ~  ■  i  front  of  the  detector  so  that 

T  *2  radiation  is  detected  from  only  one 

FIG  S’  Space-lime  diagram  showing  excitation  pulses  and  Of  the  two  transitions  excited  This 
an  atom  traveling  at  u..  demonstrates  that  the  beating 

arises  from  an  interference  in  the 
common  ground  state  population. 
This  result  holds  independent  of  the  frequency  splitting  of  the  two  optical  transitions  excited. 
By  proper  selection  of  materials  narrow  band  excitation  of  beats  in  the  femtosecond  and 
attosecond  regimes  is  possible.  These  beets  can  be  observed  with  slow  detectors  as  the  beat  is 
present  in  the  integrated  echo  intensity  signal  not  in  the  individual  echo  sigmel  envelope. 

The  echo  intensity  is  calculated  by  following  a  moving  atom  which  is  irradiated  at  1 1 
end  t2.  The  lasers  are  not  at  exact  resonance  but  must  both  excite  the  same  velocity  subgroup 
as  we  are  interested  only  in  the  situation  where  beets  occur.  We  use  a  ♦  and  -  subscript  to 
indicate  time  just  before  and  after  the  pulse  arrival  end  we  list  the  wevefunction  development 
as  follows. 

t>t  1  *  >=|g>*exp(  -iO(  t-t  j )]  |e>*exp(  -  iO'(  t-t  j )]  |e’> , 
t2_Hg>+exp[-iO(t2-t1)]|e>+exp(-Kl'(t2-t|)]|e,>I 


FIG  S’  Space-lime  diagram  showing  excitation  pulses  and 
an  atom  traveling  at  u.. 
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t2*>*|e>+|e'>*(exp[-IO(t2-tj)]|9>*axpI-iO'(t2-t|)]|Q’>}, 

t>t24>"exp(-i0(t-t2)l|e>'»expt-10’(t-t2)]|e‘> 

♦  {expl-jQ(t2-tj)]-»exp[-iO'(t2-tj)]}|g>. 

From  the  wavefunction  at  we  calculate  the  dipole  moment  of  the  atom  being  followed  and 
label  it  with  the  subscript  i: 

Pi-{expiO[(t-t2)-(t2-t^  )]}•[  1  ♦exp-K 0  -0)( 1 2~t | )]  ♦  c.c.. 

The  dipole  density  is  then  obtained  by  summing  over  all  atoms,  yielding 

P(x,t)«2  jPj(t)S(x-Xj-Vjt)  where  Jdvg(v).  The  lineshape  is  given  by 

g(v)*(  1//2tt)(  l/v_)exp-(v-vs)2/vp2  where  vs=(aQ/0)c  is  the  average  velocity  of  the 
interacting  atoms  as  determined  by  the  detuning  of  the  laser  from  the  resonance  frequency  of  an 
atom  at  rest,  the  quantity  VpKTQ/T-Jvp ’s  the  sPre«l  in  velocities  covered  by  the  laser  pulse 
whose  duration  is  Tp.  The  radiated  field  from  those  atoms  moving  at  v^  is  obtained  from 

E~JdxP(x,t*x/c)  -'SjPjlt-'tXj  +  VjO/Cc-Vj)]. 

The  relationship  between  the  various  parameters  is  displayed  in  figure  5  and  used  to  evaluate  P  j 
to  order  Vj/c.  This  done  we  can  obtain  E  and  the  corresponding  integrated  intensity  which  we 
write  as 

1 1,1  ’echo"1  ♦  exp[(- l/4)[  ivp(  0-0‘  )/c]2cos((  D-D‘)l(  1  «•  v$/c)] 

Since  the  modulation  is  observed  only  when  both  lasers  are  simultaneously  resonant  on  the  same 
atom  the  Doppler  shift  occurs  only  in  the  difference  frequency.  In  the  limit  that  the  lasers  are 
multimode  and  the  mode  density  is  dense  then  the  response  becomes  Doppler  free  and  is  given  by 

Jdtlgcho-  1  ♦  exp{-(lM)[i(n--0)/c]2(vD2^v  2/2)}cos<n-0-)T} 


where  vD  is  the  Doppler  velxity.  The  term  involving  vn  degrades  the  echo  in  Na  with  a  time 
constant  of  400  nsec  when  the  Na  temperature  is  410  Ki  This  is  the  dominant  relaxation  term 
after  lifetime  relaxation  and  only  becomes  important  when  the  time  between  the  first  excitation 
pulse  and  the  echo  exceeds  50  fluorescence  lifetimes. 


CONCLUSION 

It  has  been  demonstrated  that  incoherent  or  broadband  light  is  suitable  for 
generating  photon  echoes  and  for  performing  time  delayed  four  wave  mixing  experiments.  The 
broadband  character  of  the  light  makes  it  possible  to  excite  a  wide  spectrum  of  resonances. 
This  is  useful  when  relaxation  broedening  is  important  The  simplicity  of  the  technioue  makes 
it  a  promising  tool  for  the  study  of  fast  relaxation  processes  and  wide  band  spectroscopy. 
Insight  into  the  generation  of  coherent  beats  with  broetoand  light  was  obtained  by  performing 
a  parallel  experiment  with  e  pair  of  narrowband  lasers.  Doppler  free  spectroscopy  should  be 
possible  with  this  technique. 
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Comment  on  "Diffraction-Free  Beams' 


The  recent  work  of  Dumin,  Miceli,  and  Eberlyl  on  Diffraction-Free  Beams  brings  to  mind 
the  experiment  which  generates  the  Poisson  Spot. 2  As  normally  observed  by  diffracting  parallel 
light  around  a  circular  obstacle  the  Poisson  Spot  retains  its  intensity  on  axis  as  one  recedes  from 
the  obstacle  while  its  radius  increases  linearly.  However  if  the  obstacle  is  placed  in  the  focal  plane 
of  a  following  lens  as  was  the  case  in  ref.  1  the  illuminated  spot  which  is  observed  in  the  center  of 
the  shadow  exhibits  the  general  characteristics  which  are  described  in  Dumin  et  al.  For  our 
experimental  conditions,  as  described  below,  the  spot  retains  its  intensity  and  sharpness  as  the 
observation  point  is  moved  over  the  latter  half  of  the  156  cm.  range  on  which  it  forms.  However, 
we  probably  ought  not  to  say  that  the  spot  constitutes  a  diffractionless  propagating  beam.  It  is 
rather  a  line  image,  as  is  shown  by  the  fact  that  a  new  obstacle  placed  in  its  path  does  not  obliterate 
the  spot  further  along  the  axis.  In  figures  la  and  lb  we  show  the  shadow  of  a  .8  cm  ball  bearing 
illuminated  by  a  1.5  cm  diameter  collimated  He-Ne  laser  light  beam  as  observed  on  a  white  card 
150  cm  from  a  25  cm  lens  which  is  analogous  to  the  lens  in  the  apparatus  of  Ref.  1.  Figure  la 
shows  the  normal  unblocked  situation  while  figure  lb  was  obtained  when  a  .3  cm  diameter  rod 
was  inserted  accross  the  beam  at  a  position  40  cm  before  the  white  card.  The  shadow  of  the  rod  is 
clearly  visible  as  is  the  undisturbed  Poisson  Spot.  This  shows  that  the  energy  in  the  spot  did  not 
get  there  by  traveling  along  the  axis.  The  rod  only  blocks  that  pan  of  the  image  lying  close  enough 
to  the  rod  so  that  it  cannot  "see"  the  edge  of  the  ball  bearing.  Funher  away,  light  skimming  the 
edge  of  the  ball  bearing  has  a  free  path  to  the  image  position  and  hence  it  is  not  affected. 
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A  new  formalism  of  the  Kronig-Penney  model  has  been  developed  which  is  considerably  simpler 
than  the  conventional  one.  It  gives  not  only  the  carrier  energy  bands  but  also  the  wave  functions  at 
the  edges  of  each  band.  Although  it  is  formulated  to  be  applied  to  superlattices,  it  is  also  applicable 
to  bulk  materials. 


I.  INTRODUCTION 

The  Kronig-Penney  model1  is  an  idealized  method  of 
calculating  the  energy  bands  in  crystalline  solids  with 
periodically  varying  potentials.  It  has  been  applied  pri¬ 
marily  to  bulk  semiconductors2'3  to  study  the  qualitative 
nature  of  band  structures.  Recently,  it  has  also  been  ap¬ 
plied  to  semiconductor  superlattices4-7  to  calculate  quan¬ 
tum  confined  energy  subbands.  Here,  because  of 
conduction-  and  valence-band  discontinuities, 
smaller-energy-gap  layers  act  as  potential  wells  between 
larger-energy-gap  layers.  Though  the  effective  mass  of 
carriers  is  the  same  throughout  bulk  materials,  it  is 
different  in  the  well  and  barrier  layers  of  superlattice 
structures.  This  effective  mass  difference  must  be  taken 
into  account  by  the  appropriate  boundary  condition  at  the 
well/barrier  interface.  The  conventional  formalism  of  the 
Kronig-Penney  model  for  bulk  materials  must  therefore 
be  modified  for  superlattices. 

Mukherji  and  Nag4  considered  the  effective-mass 
difference  in  formulating  the  Kronig-Penney  model,  but 
assumed  that  the  wave  function  and  its  first  derivative 
were  continuous  at  the  interface.  Bastard3  replaced  the 
assumption  of  continuity  of  the  first  derivative  of  wave 
function  with  an  envelope  function  approximation.  He 
showed  that  the  first  derivative  of  the  wave  function  divid¬ 
ed  by  effective  mass,  or  the  probability  current,  is  continu¬ 
ous  at  the  interface  under  certain  conditions.  Although 
several  other  boundary  conditions  have  been  pro¬ 
posed,8-10  Bastard’s  is  the  simplest  and  most  intuitively 
appealing.  The  results  of  particle  energy  calculations 
based  on  Bastard’s  boundary  condition  are  consistent  with 
the  experimental  data."  Recently,  the  Kronig-Penney 
model  with  Bastard’s  boundary  condition  was  also  used 
to  calculate  the  band  offset  of  GaAs/A!jGa|_,  As  super- 
lattices.12 

The  purpose  of  this  paper  is  to  present  a  new  formalism 
of  the  Kronig-Penney  model  developed  using  Bastard's 
boundary  condition.  This  formalism  is  based  on  the  ob¬ 
servation  that  wave  functions  corresponding  to  maximum 

Ae"r:  -1"2  -Be-‘a'!-a/2'  =  'Pa{z),  0 
*  *  "  CV  2  +De  *6/2!  =  'IVz),  -blzi  0, 


and  minimum  energies  of  each  band  have  definite  parities. 
Since  this  formalism  is  simpler  and  easier  to  manipulate 
than  the  conventional  formalism,  it  can  be  used  to  find 
energy  bands  and  wave  functions  in  superlattices  and  bulk 
materials  by  simple  numerical  analysis.  In  the  following 
section,  both  the  conventional  formalism,  and  the  new 
formalism  of  the  Kronig-Penney  model  with  Bastard’s 
boundary  condition,  will  be  developed  and  compared.  In 
Sec.  Ill  application  of  the  new  formalism  will  be  illustrat¬ 
ed  by  finding  the  energy  subbands  and  envelope  wave 
functions  of  a  GaAs/AlxGa,  As  superlattice.  This  ap¬ 
plication  will  then  be  discussed. 

II.  FORMULATION 

The  periodic  potential  in  a  superlattice  is  not  a  real 
atomic  potential,  but  a  periodic  repetition  of  different  en¬ 
ergy  gap  layers,  the  thickness  of  which  is  usually  many 
times  larger  than  the  period  of  the  atomic  potential.  As  a 
consequence,  the  actual  wave  function  of  a  superlattice  is 
a  bulk  atomic  wave  function  modulated  by  the  envelope 
wave  function  of  the  superlattice  potential.  The  energy- 
bands  of  a  superlattice  are  composed  of  a  discrete  series  of 
subbands  induced  by  the  superlattice  potential  inside  a 
bulk  band. 

In  this  section,  we  will  formulate  the  energy  subbands 
and  envelope  wave  functions  induced  by  the  superlattice 
potential,  using  a  new  formalism  of  the  Kronig-Penney 
model  and  applying  Bastard’s  boundary  condition.  Be¬ 
fore  developing  the  new  formalism,  the  conventional  for¬ 
malism  is  first  developed  for  comparison. 

A.  Conventional  formalism 

Figure  1  shows  the  periodic  square  potential  of  a  super- 
Lattice  with  well  thickness  a,  barrier  thickness  b,  and  har¬ 
rier  height  V.  Panicle  effective  mass  is  m^  at  the  well  and 
mb  at  the  barrier.  From  the  solution  of  the  Schrddinger 
equation,  the  envelope  wave  function  m  two  adjacent 
periods.  —  b  <z  <a  and  J  <  Z  <  d  -a.  can  be  w  ntten  as 
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where  d  =a  +b  is  the  period  of  the  superlattice,  A,  B,  C,  and  D  are  complex  numbers,  e‘*  is  the  phase  factor,  and  z  is 
the  axis  perpendicular  to  the  superlattice  layers,  a  and  (3  are  defined  as 


a  =  -j(2maE)]/2 

ft 


ps^[2mb(E  -V)]u2  , 

ft 


where  E  is  the  particle  energy.  'Pg,  'f'c,  and  'Vj  are  defined  only  inside  a  well  or  a  barrier.  We  note 
|  ¥,(())  |  =  |  *>«(<*)  I  and  \*b(-b)\  =  |  ^*(0)1  from  the  symmetry  of  the  wells  and  barriers,  and  also  note 
j  vp(z  +d )  I  =  |  'J'(z)  |  from  the  periodicity  of  superlattice  potential.  Using  Bastard’s  boundary  condition  at  z  =0  and 
z  =j,  i.e., 


'Pa(0)  =  'iy0),  — 

ma  mi, 


m„  mj 


where  <1 >'a(z)  and  q^lz)  are  the  first  derivatives  of  ^(z)  and  'l/6 (r),  respectively,  and  applying  the  condition  that  A ,  B,  C. 
and  D  should  not  be  zero  simultaneously,  we  obtain 


1 

ma 


_J_  1 

ma 

.<l<t>*i>0l  j: 


^  ma  _  tX  „  ,aa  fd  n<t>—b0 1  _  @ 


Note  that  by  Bloch’s  theorem  'P(z)  =  e,fc!u  (z I,  where  k  is  the  wave  number  of  the  envelope  function  in  the  z  axis  and 
u(z)  is  a  periodic  potential  satisfying  utz)  =  u(z  +  d).  It  is  obvious  that  tlz  -*-d)  =  e‘kde'l^u\2  +  d)  is  e,w'4'(z).  Thus  <t> 
is  equal  to  kd.  By  simplifying  the  above  determinant  and  replacing  <J>  by  kd,  we  obtain 


cos(kd)—cos  ~(2 m0£)l/2  cosh  ^[2 mb(V  —E)]wl 

ft  ft 
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-sin  —12 maE)[/2  sinh  ~z[2mhi  V  —  £i]‘  ~  :  for  V>E 
ft  ft 
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where  fi=h  /2n  and  h  is  the  Planck's  constant. 

Similar  equations  have  been  derived  by  Bastard/  who 
also  included  the  effect  of  momentum  in  the  direction 
parallel  to  the  layers.  If  continuity  of  d'V'z'/dz  is  adopt¬ 
ed  instead  of  ( 1  /m  id'pi  zi/dz,  t/Vj/m*1'  *  is  replaced  by 
imii/Wj  i'  •  and  vice  versa,  and  the  overall  energy  bands 
shift  upward."  From  Eqs.  < 3)  and  >4)  the  E-k  dispersion 
relations  can  be  readilv  obtained. 


B.  New  formalism 


Even  though  lhe  conventional  formalism  yield  the  en¬ 
ergy  bands  of  superlaltices  or  bulk  materials,  n  is  quite 
complicated  and  cannot  yield  the  envelope  wave  fun^ 
lions  A  r  „vs  formalism,  which  will  be  developed  here,  is 
simpler  than  the  conventional  formalism  and  yields  the 
envelope  wave  functions  corresponding  to  the  edge  ener- 
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FIG.  1  A  schematic  view  of  the  periodic  square  potential  of  a 
superlattice,  a  is  the  well  thickness,  b  is  the  barrier  thickness, 
d  =a  ->-b  is  the  period  of  the  superlattice,  and  V  is  the  bamer 
height,  and  tFf  are  the  envelope  wave  functions  in  the  wells 
at  0<r  <a  and  d  <z  <d  +  a,  respectively.  '*’*  and  ty*  are  the 
envelope  wave  functions  in  the  barriers  at  —  6  <z  <  0  and 
a  <  z  <d,  respectively. 


gies  of  each  band.  These  envelope  wave  functions  are  use¬ 
ful  to  analyze  the  superlattice  properties. 

From  Eqs.  (3)  and  (4),  the  phase  difference  < <P  =  kd )  be¬ 
tween  'F(z)  and  Wz  -*-d)  is  0  or  ~~  for  the  wave  function 
corresponding  to  minimum  and  maximum  energy  of  each 
band.  The  phase  difference  between  Vt/<J(0)  and  'Pc(d)  is 
the  sum  of  the  phase  difference  4><,  between  (0)  and 
't’a(a),  and  the  phase  difference  <t>j  between  and 

A>d(d).  When  A  is  different  from  B  or  —  B  in  Eq.  ( 1  >,  <t>_,  is 
different  from  0  or  xtr.  Similarly,  when  C  is  different  from 
D  or  —  D  in  Eq.  (1),  <t>b  is  also  different  from  0  or  ±ir.  In 
general,  when  and  <Pb  do  not  have  even  or  odd  parity 
(when  A=r=B  or  — B ,  or  CVZ)  or  —  D),  the  total  phase 
difference  cannot  be  0  or  zr.  Therefore,  the  envelope 
wave  functions  corresponding  to  the  minimum  and  max¬ 
imum  energy  of  each  band  must  have  even  or  odd  parity. 

I.  Odd-index  bands  (n  =  1,3,5,  . . .) 

As  will  be  shown  in  Fig.  2,  the  minimum  energy  of 
every  odd-index  band  corresponds  to  cos<t>=!  or  <t>=0. 
Following  the  above  arguments,  <t>a  and  <t>*  should  both 
be  even  or  both  be  odd  at  minimum  energies.  Note,  how- 


FIG.  2.  The  first  five  electron  energy  subbands  of  a 
GaAs/Al,Gai_,As  superlattice  with  x  =0.5,  a  =  100  A,  and 
b  =25  A.  V  =  375  meV  is  the  barrier  potential  height.  Dashed 
curves  (a)  and  (bi  correspond  to  the  functions  in  the  left-hand 
side  of  Eqs.  (7)  and  (8),  and  (111  and  (12),  respectively.  Dotted 
curves  i cl  and  1  d 1  correspond  to  the  functions  in  the  left-hand 
side  of  Eqs.  (15)  and  (16),  and  <  19)  and  (20),  respectively.  Solid 
curve  (e)  corresponds  to  the  right-hand  side  of  Eqs.  i 3 1  and  (4). 
respectively,  n  is  the  band  index. 


ever,  that  at  large  b  each  band  in  a  well  degenerates  to  an 
energy  level  with  vanishing  bandwidth.  Thus  4'™°  (corre¬ 
sponding  to  minimum  energy)  and  (corresponding  to 
maximum  energy)  become  identical.  Hence,  the  panties 
of  'F™n  and  'FJ1**  must  be  the  same  for  any  value  of  b. 
Furthermore,  4*^  is  even  for  the  first  (ground1  state,  odd 
for  the  second  state,  and  alternating  even  and  odd  with 
increasing  n.  Therefore,  both  'F™n  and  'F™ln  of  every 
odd-index  band  have  even  panty,  and  can  be  wntten  as 


where  6  =  i/3  =  [2mfc'  F  — £■]'  :/ft. 

Using  Bastard's  boundary  condition  at  z  =0.  t.e., 

>FJi0i  =  'Fh(0) 

and 
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we  obtain 
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.  oa  n  »  bb  A  aa  B  c  .  ,  bb  c  r,  „ 

^  cos  — -  =5  cosh  —  ,  - asm  — -  = - ostnh  —  for  V  >  E  , 

2  2  ma  2  mb  2 

A  u  aa  D  bb  A  aa  —B  a  .  bp  r  t/  c 

A  cosh  —  =B  cos  —  ,  - asm  — -  = - Psin  —  for  V  <E  , 

2  2  ma  2  mb  2 

which  in  turn  gives  the  minimum  energy  of  every  odd-index  band: 

m  v  11/2  , 

tan  ^(2 m,£m,„)l/!  -  —  tanh  -£-[2 mb(V -Emm)]'n  =0  forK>£,  (7) 

Lmin 

rm  r  K  ]i/2 

tan  £(2ma£min)l/2  +  —  1-  — -  tan  ~[2rnb{Emm-V))in  =0  forF<£.  (8. 

Zfi  mb  h,  m,n  Zfi 

The  smallest  solution  of  Eq.  (7)  corresponds  to  n  =  1  (ground  subband),  the  next  solution  to  n  =3,  etc.  The  band  in¬ 
dex  corresponding  to  the  smallest  solution  of  Eq.  (8)  depends  on  how  many  bands  are  confined  inside  the  well. 

To  find  the  maximum  energy  of  each  odd-index  band,  we  note  that  cos<t>  is  —  1  or  «t>  is  ±tr.  <Pa  is  0  because  'I',,  is 
even.  Since  <J>a  +  <t>(,  is  ±ir,  <J>b  is  xtt,  or  <J>t  is  odd.  Hence,  the  wave  functions  can  be  written  as 


'l'J,“(z)=  A  cos  a  z—  y  ,  'P'b*x=B  sinh  6  z+y  forF>£, 

V™'(z)=Aco s  a  z-y  ,  sin  j/?  z  +  y  for  V  <£  . 

Following  the  same  procedure  as  above,  we  obtain  the  maximum  energy  of  every  odd-index  band: 

[  \m  \  V  I  1/2  h 

tan  £<2 maEm3X)'/2  -  —  - - -1  I  coth  ^-[2mt(  V -£m„  )]l/2  =0  for  V  >  £  , 


ma 

V 

E  -1 
max 

mb 

,  v 

mb 

Em 

2.  Even-index  bands  (n  —  2,4,6, . . .) 

For  even-index  bands,  the  minimum  energy  corresponds  to  cos<J>=  — 1  or  <J>=:!:j7.  Since  is  odd  and  <J>j  is  r:, 
4>(,  =  <l>  —  <t>a  is  0.  Thus  ♦(,  is  even,  and  we  can  wnte 

'F2’"’(z)=  A  sin  [a  z  —  1  ,  'FJ"n  =  £cosh  6  z  4-  —  forF>£,  1 1 3 1 


'V?m(z)=A  sin  la  z-~  j,  'Pfm  =  Bcos\B  z+~  for  F<£. 

[  I  2  I  I  2 

Using  the  boundary  condition  and  simplifying  as  above,  we  obtain  the  minimum  energy  of  every  even-index  band: 

I  i  r  ...  i  ..  i  1 1/2  i  .  i 
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The  smallest  solution  of  Eq.  <  1 5 )  corresponds  to  n  =2,  the  next  to  n  =4,  etc.  The  band  index  corresponding  to  the 
smallest  solution  of  Eq.  (16)  depends  on  how  many  bands  are  confined  inside  the  well. 

To  find  the  maximum  energy  of  every  even-index  band  we  note  that  cos<t>  is  1  or  <t>  is  0.  Since  Pj  is  odd  and  <J> d  is 
—  ir,  <t>b  =  <t>  —  <t>a  is  ±tt.  Thus  ’F(,  is  odd.  Hence,  we  can  wnte  the  wave  functions  as 


sin  la  z-f  j.  'I'™*  =  fi  sinh  !  6  z  —  for  F  >  £ 

I  i  2  1 1  !  ■  2  1 1 

♦~*‘z'=/<sin!a  !z-f  >Vr'=flsin  '0  'z --  '  for  F  <  £  . 
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Following  the  same  procedure  as  above,  we  obtain  the  maximum  energy  of  every  even-index  band: 

m  v  1/2  fc 

cot  £<2 ma£m„)l/2  +  —  - - -1  coth  ^[2mb(V-Emu)]'n  =0  forF>£, 


m  b 


cot  —(2  maEmtxy/2  +  -  1-  — 

2«  mb  E„ 


cot  -^[2mblEmtx-V)]'n  =0  for  V  <E 
2  fi 


Even  though  formulas  similar  to  Eqs.  (7)  and  (19)  have 
been  previously  derived  by  Masselink  et  a/.,11  their  rela¬ 
tionship  to  the  Kronig-Penney  model  has  not  been 
clarified.  In  addition,  they  are  not  a  complete  formalism 
because  they  cannot  yield  the  maximum  energies  of  odd- 
index  bands  and  minimum  energies  of  even-index  bands, 
and  the  corresronding  envelope  wave  functions. 

The  question  might  arise  why  only  one  boundary  con¬ 
dition  at  z  =0  was  used  in  developing  the  new  formalism, 
whereas  boundary  conditions  at  both  z  =0  and  z  =a  were 
used  to  derive  Eqs.  (3)  and  (4).  The  explanation  lies  in 
the  main  difference  between  the  two  formalisms.  Since 
the  conventional  formalism  is  concerned  with  all  the  ener¬ 
gies  from  0  to  oc,  Eqs.  (3)  and  (4)  indicate  that  <t>  (  =  kd) 
can  be  any  real  value  from  0  to  2 ir  (for  energies  inside 
bands)  or  any  complex  value  (for  energies  outside  bands). 
Thus  there  are  no  definite  parities  of  'Pa  and  'P*  and  no 
symmetry  relationship  between  'Pa(O)  and  'P 0(a)  or 
'Pit  —  b)  and  'PjtO).  Therefore,  in  Fig.  1,  the  boundary 
condition  at  z  =0  and  z  =a  are  different  and  both  bound¬ 
ary  conditions  must  be  used.  On  the  other  hand,  the  new 
formalism  is  only  concerned  with  the  band-edge  energies, 
at  which  <P  can  be  only  0  or  ±ir.  Thus  and  'Pj,  have 
definite  parities.  Since  'Po(0)  =  ±'P0(a)  and 
'P(,(0)  =  ±'Pj(a),  the  boundary  condition  at  z  —  0,  which 
determines  the  relationship  between  'Pa  and  'Pi,,  is  identi¬ 
cal  to  that  at  z  =  a.  This  is  the  reason  we  need  only  one 
boundary  condition  in  developing  the  new  formalism. 

When  b  becomes  sufficiently  large  for  V  >  £,  Eqs.  (7) 
and  (11)  degenerate  to 


tan  —  (2m„£)l/:  — 
2fj 


and  Eqs.  (15)  and  ( 1 9 1  to 


for  n  =  1.3,5, . 
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E 

for  n  =  2,4,6,  .... 

These  results  are.  of  course,  identical  to  the  single-well  re¬ 
sults.13 


III.  APPLICATION  AND  DISCUSSION 

As  an  illustration  of  the  application  of  the  new  formal¬ 
ism,  we  have  calculated  the  electron  energy  subbands  of  a 
GaAs/AI,Gai  _ ,  As  superlattice  with  x  =0.5,  a  =  100  A, 
and  b  =25  A  by  using  both  the  conventional  formalism 
[Eqs.  (3)  and  i4l]  and  the  new  formalism  [Eqs.  (5i  to  (20)]. 


We  have  also  calculated  the  envelope  wave  functions  us¬ 
ing  the  new  formalism. 

Figure  2  shows  the  first  five  electron  energy  subbands. 
We  have  taken  A£s  =0.476+0. 125x +0. 143x2  eV  for 
0.45  <x  <  l,14  as  the  band-gap  difference  between  GaAs 
and  AljtGai_xAs  at  room  temperature;  mo=0.067m0 
and  mb  =(0.067  +  0. 083x)m0, 15  where  m0  is  the  free  elec¬ 
tron  mass,  as  the  electron  effective  mass  at  well  and  bar¬ 
rier  layers.  We  have  assumed  65%:35%  band  offset  ra¬ 
tio1 1,16,17  between  the  conduction  and  valence  band.  Thus 
the  barrier  potential  V  is  given  as  0.65A£?  =  375  meV. 
As  shown  in  Fig.  2,  the  first  three  bands  are  confined  in¬ 
side  the  well  and  the  remaining  two  bands  are  not.  Curve 
la)  in  Fig.  2  corresponds  to  the  function  in  the  left-hand 
side  of  Eqs.  (7)  and  (8)  for  £  <  K  and  £  >  V,  respectively. 
This  curve  meets  the  £  axis  at  31,  268,  and  684  meV. 
Since  Eqs.  (7)  and  (8)  give  the  minimum  energies  of  odd- 
index  bands,  these  three  values  are  the  minimum  energy 
of  the  first,  third,  and  fifth  subband,  respectively.  Curve 

(b)  corresponds  to  the  functions  in  the  left-hand  side  of 
Eqs.  (11)  and  (12).  Obviously,  it  gives  the  maximum  ener¬ 
gy  of  the  first,  third,  and  fifth  subband.  Similarly,  curves 

(c)  and  Id)  correspond  to  the  functions  in  the  left-hand 
side  of  Eqs.  (15)  and  (16),  and  (19)  and  (20),  respectively. 
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FIG.  3.  Envelope  wave  functions  corresponding  to  the  edge 
energies  of  the  first  five  subbands  shown  in  Fig  2.  'P1''*'  is  the 
wave  funclion  corresponding  to  the  maximum  edge  of  ihe  first 
subband,  and  'PC'1"1  is  that  corresponding  to  ihe  minimum  edge  of 
the  first  subband,  etc  6=25  A  is  the  barrier  thickness: 
a  =d  —  b  =  100  A  is  the  well  thickness  The  amplitude  of  the 
wave  functions  has  arbitrary  units. 
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They  give  the  minimum  and  maximum  energies  of  the 
second  and  fourth  subband,  respectively.  All  these  curves 
are  obtained  from  the  new  formalism.  Curve  (e)  is  ob¬ 
tained  from  the  conventional  formalism.  It  corresponds 
to  the  functions  in  the  right-hand  side  of  Eqs.  (3)  and  (4) 
for  £  <  V  and  £  >  V,  respectively.  We  can  easily  identify 
the  first  five  subbands. 

If  we  define  /(£)  as  the  longitudinal  axis  of  the  curves 

(a) -(e),  we  note  that  curve  (e)  meets  the  /(£)  =  1  or  —1 
line  at  exactly  the  same  points  on  the  E  axis  as  the 
minimum  and  maximum  energies  found  from  curves  (a), 

(b) ,  (c),  and  ( d ).  This  indicates  that  the  energy  bands 
found  from  the  new  formalism  are  identical  to  those 
found  from  the  conventional  formalism,  as  they  must  be. 
Even  though  we  have  plotted  complete  curves  of  functions 
in  Eqs.  (7),  (8),  (11),  (12),  (15),  (16),  (19),  and  (20)  to  find 
the  subbands  by  the  new  formalism,  usually  it  is  not 
necessary.  The  energy  bands  can  be  easily  found  by  sim¬ 
ple  numerical  or  graphical  methods.  Of  course  this  for¬ 
malism  can  also  be  used  for  bulk  materials  with  ma=mb. 

In  Fig.  3  the  envelope  wave  functions  are  presented, 
which  correspond  to  the  minimum  and  maximum  ener¬ 
gies  of  the  first  five  subbands  of  the  superlattice  in  Fig.  2. 
These  wave  functions  have  been  found  by  using  the 
minimum  and  maximum  energy  of  each  band,  in  calculat¬ 
ing  a,  /?,  and  6;  also  the  boundary  condition 
'|'a(0)  =  'Pi,(0)  is  applied  to  Eqs.  (5),  (6),  (9),  (10),  (13), 
(14),  (17),  and  (18),  thus  eliminating  A  or  B.  Although 
these  wave  functions  are  not  normalized, 
Pb  =/(,/( /„  +/»),  where  Ia  =  /  g  Va  2dz  and  Ib 
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=  f°-b  !  'f'h  1 2dz>  gives  the  probability  to  find  an  electron 
at  a  barrier  layer.  The  calculated  values  of  Pb  for  this  su¬ 
perlattice  have  been  found  to  be  5.7%,  12%,  25%,  38%, 
and  43%  for  minimum  energy  of  the  first  five  bands,  and 
1.3%,  5.3%,  8%,  13%,  and  18%  for  maximum  energy  of 
the  first  five  bands.  They  are  an  increasing  function  of  n, 
because  electrons  become  more  energetic  with  increasing 
n  and  are  more  probable  to  leak  to  the  barriers.  Pb""  for 
is  larger  than  P™**  for  'PJ1**  of  the  same  band,  be¬ 
cause  'P™n  has  even  parity  and  'P™4*  has  odd  parity. 

We  observe  another  general  property  of  the  envelope 
wave  functions  in  wells  and  barriers  of  superlattices: 
When  E  <  V,  wave  functions  at  the  wells,  corresponding 
to  both  £m, n  and  £ma,  of  the  nth  subband,  have  n  —  1 
zeros  (meet  z  axis  n  —  1  times).  Those  at  the  barriers  have 
0  (for  £m„ )  or  1  (for  £mjn )  zero.  When  £  >  V,  wave 
functions  at  the  wells  still  have  n  —  1  zeros;  those  at  the 
barriers  have  a  different  number  of  zeros  for  different  b, 
ranging  from  0  for  small  b ,  to  infinity  for  infinitely  large 
b.  In  Fig.  3,  however,  we  see  only  0  or  1  zero  inside  the 
barrier,  because  the  barrier  is  sufficiently  thin  to  have  only 
0  or  1  zero. 

In  conclusion,  we  can  calculate  both  the  energy  bands 
and  the  wave  functions  corresponding  to  the  edge  energies 
of  each  band,  of  superlattices  or  bulk  materials,  using  the 
new  formalism  of  the  Kronig-Penney  model.  Moreover, 
the  calculation  of  energy  bands  can  be  done  more  easily 
by  the  new  formalism  than  by  the  conventional  formal¬ 
ism.  The  new  formalism  is  very  useful  for  the  analysis  of 
superlattices  and  application  to  device  designs. 
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ABSTRACT 

Laser  radiation  has  been  used  to  enhance 
both  gas-  and  surface-phase  phenomena 
occurring  in  a  plasma  reactor.  As  a 
result,  resistless  patterning  and 
light-enhanced  plasma  etch  rates  have 
been  demonstrated. 

INTRODUCTION 

In  the  plasma  etching  processes  now  routinely  used  to 
fabricate  semiconductor  devices,  a  resist  mask  is  used  to 
define  an  area  to  be  etched,  and  a  glow  discharge 
generates  reactive  species  which  etch  the  exposed  area. 
Depending  on  the  myriad  process  parameters,  such  as  gas 
components,  gas  pressure,  and  RF  discharge  frequency,  the 
etch  can  have  varying  amounts  of  selectivity,  anisotropy, 
and  damage.  However,  a  trade-off  among  these  factors 
often  exists,  making  it  difficult  to  separately  optimize 
all  of  them.  For  example,  while  increasing  the  ion  flux 
to  the  substrate  can  increase  the  degree  of  anisotropy, 
it  can  also  decrease  the  etch  material  selectivity  and 
increase  the  amount  of  ion  damage.  In  addition,  the 
requirement  of  a  resist  mask  to  define  the  etch  region 
necessitates  many  extra  steps  in  the  processing  sequence. 

Recent  experiments  have  shown  that  light-assisted 
chemistry  can  also  be  used  to  accomplish  dry  etching.  In 
addition,  the  use  of  light  to  alter  the  properties  of 
electric  discharges,  viz,  in  gas  lasers,  and  to  effect 
surface  reactions  has  been  previously  demonstrated.  As  a 
result,  it  is  reasonable  to  expect  that  the  introduction 
of  an  external  light  source  to  a  plasma  reactor  may 
provide  a  useful  tool  for  improving  the  performance  of  a 
plasma  process. 
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In  a  set  of  experiments  which  will  be  described  here, 
we  have  used  laser  light  to  control  the  etching  rate  of 
silicon  and  gallium  arsenide  in  a  plasma  etching 
environment.  Various  light-assisted  processes  have  been 
examined,  including  those  relying  on  gas-  and  surface- 
phase  interactions.  Both  substantial  etch-rate 
enhancements  and  micrometer-scale  maskless  patterning 
have  been  observed. 


CW  LASER-ENHANCED  PLASMA  ETCHING  OF  SILICON 

In  the  study  of  laser  enhanced  plasma  etching  of 
silicon,  cw  laser  light  of  various  wavelengths  and 
intensities  was  weakly  focused  onto  the  surface  of  a 
single-crystal  silicon  sample,  doped  to  varying  degrees. 
The  experiments  are  described  in  detail  elsewhere  (1,2). 
Briefly,  it  was  found  that  at  high  laser  power  densities 
(greater  than  10  kw/cm^ ) ,  there  is  an  etch  enhancement, 
which  appears  to  be  primarily  thermal  in  nature,  and  is 
independent  of  incident  wavelength,  doping  type,  and 
doping  level.  At  low  laser  power  densities  (less  than  10 
kW/cmM  however,  the  effect  appears  to  be  predominately 
a  photochemical  one,  since  the  temperature  rise  due  to 
the  laser  is  quite  low  (less  than  20  °C),  and  the 
effect  is  now  dependent  on  the  wavelength  of  the  light 
and  the  doping  level  of  the  sample. 

The  experimental  results  are  consistent  with  a 
process  that  is  controlled  by  the  flow  of  photogenerated 
carriers  to  the  surface.  Figure  1  demonstrates  the 
photochemical  nature  of  the  etch  rate  enhancement  at  low 
laser  power  densities.  Here  it  can  be  seen  that  for 
heavily  doped  silicon,  either  n-  or  p-type,  the 
photoinduced-etch  enhancement  is  greater  for  the  near-uv 
laser  light  than  for  the  visible  laser  light.  Within  the 
experimental  accuracy  of  this  work  (about  20%),  the 
results  are  the  same  for  either  n-  or  p-type  silicon. 
The  etch  rate  without  laser  enhancement  for  the  heavily 
doped  n-  and  p-type  samples  is  also  approximately  the 
same  for  the  operating  conditions  used  in  this  particular 
reactor . 


This  behavior  is  in  marked  contrast  with  the  results 
of  another  group,  which  studied  purely  photon-enhanced 
etching  of  silicon  by  chlorine  atoms  (3).  In  that 
experiment,  it  was  found  that  the  dark  etch  rate  is  much 
greater  for  n-type  than  p-type  silicon,  but  that  the 
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relative  photon-enhancement  is  much  greater  for  the 
p-type  than  for  the  n-type  material. 

i 

Although  the  precise  mechanism  by  which  the 
photogenerated  carriers  enhance  the  etching  of  silicon  in 
the  CF4/O2  plasma  has  not  yet  been  determined,  there 
are  a  number  of  possibilities.  These  include  increasing 
the  reaction  rate  between  Si  and  SiFx  groups  (as 
pointed  out  by  Houle  (4)  in  another  study  of  purely 


the  reaction  rate  between  Si  and  SiFx  groups  (as 
pointed  out  by  Houle  (4)  in  another  study  of  purely 
light-  induced  etching),  increasing  the  rate  of 
subsurface  fluorination  of  the  silicon,  and  removal  of 
adsorbed  layers,  such  as  hydrocarbons,  on  the  silicon 
surface,  which  may  limit  the  reaction  rate  by  blocking 
surface  sites.  Note  that  all  three  of  these  mechanisms 
result  in  increased  surface  desorption  in  the  illuminated 
region. 


In  order  to  examine  whether  a  more  precisely  defined 
surface  layer  could  be  affected  by  laser  light,  polymer 
deposition  was  induced  by  adding  hydrogen  to  CF4  in  the 
same  etching  reactor.  It  was  found  that  both  the  plasma 
etch  rate  and  the  etch-rate  enhancement  due  to  the  laser 
were  altered. 

First,  Figure  2  shows  the  reduction  of  the  dark  etch 
rate  with  the  addition  of  hydrogen.  It  is  thought  that 
this  reduction  in  etch  rate  is  due  to  the  onset  of  two 
successive  processes;  H  atoms  scavenging  F  atoms  from  the 
system  via  the  formation  of  HF,  a  nonreactive  gas  for 
polymer  etching,  and  polymer  deposition  on  the  surface 
due  to  the  altered  gas-phase  chemistry  (5,6), 
respectively. 

Second,  when  the  laser  is  used  to  illuminate  the 
surface  of  the  silicon  during  the  etch  process,  a  clearly 
defined  enhancement  is  again  observed.  Figure  3  shows 
the  etch-rate  enhancement  due  to  the  laser.  The  most 
prominent  feature  is  the  sharp  rise  in  enhancement  at 
15-mTorr  H2  partial  pressure.  According  to  a  previous 
study,  the  F  concentration  in  the  gas  will  begin  to  rise 
after  its  initial  fall  at  about  this  hydrogen 
concentration  (6).  As  a  result,  the  etch  rate  will  then 
be  limited  only  by  polymer  deposition.  Under  these 
conditions,  the  laser  light  incident  to  the  surface  will 
enhance  the  etch  rate  due  to  the  desorption  of  polymeric 
material.  The  reduction  in  polymer  deposition  due  to  the 
laser  light  can  be  seen  visually  by  a  color  change  in  the 
polymer  layer  in  the  vicinity  of  the  laser  spot,  and 
through  Auger  electron  spectroscopy,  which  shows  a  marked 
reduction  in  polymer  coverage  in  the  illuminated  region. 


An  important  application  of  this  technique  is  that  it' 
can  be  used  to  accomplish  resistless  processing.  Since 
the  strongly  etched  region  is  defined  by  the  laser  light, 
projection  of  a  masked  beam  on  the  surface  directly 
patterns  the  etched  region.  This  form  of  in  situ 
processing  was  demonstrated  by  patterning  the  laser  beam 
with  a  simple  wire  grid.  The  resolution,  which  was 
limited  by  the  demagnification  of  the  projection  lens,, 
was  several  micrometers. 

The  role  of  the  laser  light  in  preventing  polymer 

film  growth  appears  to  again  be  due  to  photogenerated 
carriers  in  the  silicon.  The  low  laser  power  density 

used  r'lO  W/cm2)  would  appear  to  preclude  a  purely 
thermal  effect.  Also,  in  the  same  reactor,  the  laser 
light  does  not  have  any  effect  on  deposition  on  sapphire, 
an  insulator  which  does  not  absorb  light  at  this 
wavelength,  but  would  be  subject  to  any  surface 

photochemical  effects  in  the  same  reactor. 

In  a  second  set  of  experiments,  we  examined  the 
effect  of  laser  light  on  controlling  the  plasma 
discharge,  either  through  its  chemical  species,  or 

through  its  electrical  properties.  These  studies 
involved  laser-enhanced  plasma  etching  of  GaAs.  The 
output  from  a  193-nm  excimer  laser  is  used  to  illuminate 
the  discharge  region  between  two  electrodes  in  a  pulsed 
d.c.  discharge.  Both  the  laser  and  plasma  are  pulsed  at 
a  repetition  rate  of  30  Hz.  The  laser  power  is  typically 
20  mJ/pulse,  and  the  discharge  is  typically  operated  at 
600V  and  a  pulse  width  of  200  usee.  The  gas  used  is 
HBr,  at  a  pressure  of  5  torr  and  a  flow  rate  of  25  seem. 

Previous  work  describes  some  aspects  of  the 
experimental  arrangement  in  more  detail  and  discusses  the 
etch  results  obtained  with  laser  illumination  alone, 
i.e.,  without  a  plasma  discharge  (7).  A  recent  extension 
of  these  purely  laser  studies  has  shown  that  partially 
brominated  surface  layers  play  an  important  role  in  the 
etching  process.  In  particular,  when  the  etched  samples 
are  examined  using  x-ray  photoelectron  spectroscopy 
( XPS ) ,  it  is  found  that  the  chemical  composition  and  the 
depth  of  the  surface  layer  varies  according  to  the 
processing  conditions  employed.  Figure  4  shows  the 
arsenic,  gallium  and  bromine  peaks,  obtained  by  etching 
under  several  different  substrate  temperatures.  The 
exact  energy  of  the  arsenic  and  gallium  peak  depends  on 
the  local  bonding  environment.  For  example,  gallium  from 
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the  substrate  ( GaAs )  can  be  distinguished  from  gallium 
due  to  GaBrj.  However,  the  peaks  due  to  GaBrj  and 
Ga203  can  not  be  resolved  in  this  apparatus.  The | 
same  is  true  for  the  arsenic  peaks.  These  results  are1 
consistent  with  what  has  been  found  by  other  researchers 
in  different  etching  systems.  For  example,  in  fluorine 
etching  of  silicon,  it  has  been  found  that  subsurface) 
fluorination  occurs,  also  to  a  depth  of  about  20  or: 
30  (8) .  i 

j 

Adding  the  laser  illumination  to  the  discharge  region ; 
I  has  several  effects  on  the  etching,  particularly  as  a' 
result  of  phenomena  involving  the  adsorbed  layers. i 
First,  the  etch  rate  is  greatly  increased  over  the  plasma 
etch  rate,  especially  at  temperatures  high  enough  so  that 
the  process  is  not  limited  by  desorption  of  the  etch 
reaction  products,  namely  GaBr3  and  AsBr 3 .  This  is 
shown  in  Figure  5,  where  it  can  be  seen  that  the  laser 
enhances  the  plasma  etch  rate  by  almost  an  order  of 
magnitude  at  100°C.  Note  that  in  this  case,  the  laser 
beam  is  aimed  over  the  substrate  in  the  pulsed  plasma 
discharge.  Calculations  indicate  that  at  the  lower 
temperatures,  where  the  laser  has  little  effect  on  the 
etching,  as  shown  in  Figure  5,  the  etch  rate  is  limited 
by  desorption  of  the  GaBr3  products  from  the  GaAs 
surface,  so  the  creation  of  more  gas-phase  bromine  by  the 
laser  has  little  effect  on  the  etch  rate.  However,  at 
the  higher  temperatures,  where  desorption  is  not  as 
limiting  a  process,  the  laser  has  a  large  effect,  since 
the  bromine  it  creates  can  be  used  effectively  in  the 
etching. 

Second,  etching  with  the  plasma  alone  or  the  laser 
alone  yields  etch  profiles  which  are  quite  different. 
The  etch  profile  obtained  using  just  the  laser  follows 
crystallographic  lines,  while  that  obtained  using  just 
the  plasma  is  more  nearly  vertical.  With  the  laser  and 
plasma  together,  an  intermediate  etch  profile  is  obtained 
which  can  be  more  or  less  anisotropic  depending  on  the 
relative  contribution  of  the  laser  versus  that  of  the 
plasma.  This  behavior  is  due  to  the  laser  changing  the 
density  of  neutral  bromine  atoms  in  the  plasma  region, 
relative  to  the  density  of  ions  created  by  the 
discharge.  While  the  neutral  species  tend  to  cause 
crystallographic  etching,  the  ions  tend  to  cause  etching 
which  is  more  directional. 

Laser  light  can  also  be  used  to  control  the 
electrical  properties  of  the  discharge.  Ultraviolet 
controlled  or  sustained  discharges  have  been  used  to 
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obtain  abnormally  low  ratios  of  electrical  field  to 
neutral  particle  density.  This  low  ratio  can,  for 
example,  yield  a  more  efficient  discharge  for  obtaining 
population  inversion  in  a  laser.  In  this  connection,  we 
have  also  begun  to  investigate  the  use  cf  UV  excimer 
laser  light  to  influence  the  electrical  parameters  in  the 
pulsed,  HBr  discharge  described  in  the  paragraph  above.  j 

In  particular,  preliminary  work  has  been  done  Ln[ 
studying  changes  in  plasma  characteristics  due  to  the 
laser  illumination.  The  technique  used  was  to  irradiate 
the  discharge  region  with  the  laser,  and  at  some  fixed,; 
time  delay  apply  a  voltage  pulse  to  the  discharge 
plates.  The  voltage  applied  is  below  the  threshold  at 
which  the  gas  will  break  down  without  any  laser 
illumination.  Care  was  taken  to  ensure  that  the  laser 
light  did  not  hit  the  electrodes. 

The  first  experiments  showed  that  193-nm  light  was 
effective  in  lowering  “the  breakdown  voltage  in  the 
plasma.  Thus,  if  a  pulse  of  excimer  light  preceded  the 
application  of  the  pulsed  waveform  to  the  discharge 
plate,  the  breakdown  of  the  voltage  could  be  lowered  as 
much  as  50%.  Further,  the  magnitude  of  the  reduction  in 
voltage  depended  on  the  time  between  laser  pulse  and  the 
discharge,  and  the  gas  conditions  in  the  discharge 
region.  For  example,  the  higher  the  HBr  pressure,  the 
longer  the  duration  of  the  interval  of  low 
discharge-breakdown  voltage.  The  details  of  this 
behavior  are  in  accord  with  a  diffusion-limited  loss  of 
the  photogenerated  species.  In  addition,  the 
conductivity  or  carrier  density  in  the  plasma  is  found  to 
be  directly  proportional  to  the  laser  power,  when  the 
193-nm  laser  pulse  and  the  application  of  the  voltage 
pulse  to  the  discharge  plates  are  made  to  coincide.  This 
behavior  is  shown  in  Figure  6,  which  displays  the 
variation  in  discharge  current  with  laser  power,  for  a 
fixed-voltage  pulse.  Both  of  these  experiments 
illustrate  that  UV  light  can  lower  the  operating  voltage 
within  the  plasma,  and  yet  still  obtain  significant 
discharge  current. 

In  summary,  we  have  shown  that  laser  light  can  be 
used  to  control  surface  desorption,  chemical  species,  and 
electrical  parameters  of  a  plasma  reactor.  These  laser 
control  phenomena  have  been  used  to  obtain  the  practical 
advantages  of  maskless  patterning  and  higher  etch  rate 
(at  reduced  driving  voltage)  in  a  plasma  etching 
environment . 
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density  is  approximately  0.05  W/cm*.  The  laser  wavelength  is 
350  nm,  and  the  laser  power  density  is  approximately 
10  W/cm^. 
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Figure  4:  XPS  spectra  of  etched  GaAs  surface,  with 
varying  substrate  temperature  during  the 
etch  process.  The  laser  orientation  is 
indirect . 


PLASMA  CURRENT  VS  LASER  POWER 


o  o  o  o  o  o  o 
<D  in  m  rvj  — 


csiiNn  *8av)  Aviaa  3hii  o  iv  iN3yano 


O  0) 
C  M  O' 

o  «  <0 

C  )  W 
4)  O  f-> 

U  •  0  o 

U  1-4  M  > 
3  —l 
O  *>  3  01 

«  a  o> 
0)  u 
9  ><  U  4 
U  O'  41  SZ 
«  U  «  O 
C  4)  <Q  M 
O  C  rH  -*4 
wo  -v 
0) 

•a  u  sz  v 

0)  w  v 
■C  * 

UH  Uj 

•  o 

0KN 

c  x:  E  O' 
OH  U  C 


X  •  -H 
O'  u  m  O' 
-h  m  •  v 


\v.%V.  .'  y, 


n.1  vr  x.tx*k.t*.th  fv  *\  vv 


251 


OPTICALLY-INDUCED,  ROOM-TEMPERATURE  OXIDATION  OF  CALLIUM  ARSENIDE 

Chien-Fan  Yu,  Michael  T,  Schmidt.  Dragan  V.  Podlesnik,  and  Richard  M.  Osgood 
Jr.,  Microelectronics  Sciences  Laboratories,  Columbia  University,  New  York, 
NY  10027 


ABSTRACT 

Room-temperature,  optically-induced  oxidation  of  the  gallium  arsenide 
surface  has  been  studied  with  laser  radiation  of  different  wavelengths.  It 
was  found  that  deep-ultraviolet  light  is  much  more  effective  in  enhancing 
oxidation  than  near-ultraviolet  or  visible  light.  The  growth  rate  of  the 
oxide  was  also  found  to  be  drastically  increased  by  the  presence  of 
chemisorbed  water  molecules  on  the  surface. 


INTRODUCTION 

The  study  of  optically-induced  oxidation  of  the  gallium  arsenide 
surface  has  been  a  subject  of  intensive  investigationf 1-5].  These  studies 
have  shown  that  simultaneous  exposure  of  a  semiconductor  surface  to  oxygen 
and  to  laser  light  can  stimulate  the  uptake  of  oxygen.  It  is  also  known 
that  a  thin  oxide  layer  formed  on  the  gallium  arsenide  surface  can  be  used 
to  passivate  the  surface  or  to  alter  the  interface  properties[6] .  A  better 
understanding  and  control  of  this  room-temperature  process  can  lend  itself 
to  many  applications  in  the  fabrication  of  semiconductor  devices.  The 
enhancement  of  oxidation  on  the  gallium  arsenide  surface  through  above¬ 
band-gap  photon  illumination  has  been  generally  attributed  to  the  effect  of 
photogenerated  carriers(  1-5 J .  We  report  here  our  recent  study  on  the 
wavelength  dependence  and  the  effect  of  water  molecules  adsorbed  on  the 
gallium  arsenide  surface  in  this  photon-induced  process.  The  use  of  deep 
ultraviolet  light,  below  the  oxygen  dissociation  limit,  and/or  the  presence 
of  chemisorbed  water  molecules  on  the  surface  were  found  to  increase 
significantly  the  growth  rate  of  the  oxide. 


EXPERIMENTAL 

Experiments  were  carried  out  with  either  an  argon-ion  laser  tuned  to 
one  of  its  visible  or  near-uv  lines,  or  with  a  frequency-doubled  argon-ion 
laser  to  generate  257-nm  deep  uv  light.  An  excimer  laser  was  also  used  as 
an  alternative  to  generate  248-nm  (KrF)  light.  In  all  cases  the 
dissociation  limit  of  oxygen  molecules  (242  nm)  was  not  exceeded.  The  laser 
power  was  kept  sufficiently  low  so  that  the  temperature  rise  on  the  sample 
during  exposure  was  negligible.  The  typical  cw  power  density  used  was  50 
mW/cm  . 

The  gallium  arsenide  saijigles  used  in  this  experiment  were  n-type  of 
(100)  orientation  with  —  10  atoms/cmJ  doping  density.  Surfaces  were 
cleaned  by  the  following  steps  before  ex-situ  experiments:  degreasing  in 
warm  trichloroethylene,  immersion  in  acetone,  immersion  in  methanol, 
immersion  in  DI  water,  and  50%  NIl^OU  rinse.  Samples  were  then  blown  dry  and 
attached  to  the  mount  with  silver  paste.  For  in-situ  experiments,  performed 
inside  an  U1IV  chamber,  additional  surface  cleaning  was  done  by  heating  to 
540°C,  followed  by  500  eV  argon-ion  bombardment  and  then  540°C  annealing  for 
5  minutes  to  reduce  sputtering  damage. 

Surface  analysis  was  performed  using  a  multiprobe  surface  spectroscopy 
system  equipped  with  a  concentric  hemispherical  energy  analyzer.  All  XPS 
measurements  were  carried  out  with  an  AlKa  (1486.6  eV)  excitation  source  at 
240  W.  The  system  was  calibrated  from  the  Au  4f ^ ^  level  with  a  binding 
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energy  of  83.8  eV.  The  absence  of  surface  charging  was  determined  by 
observing  the  position  of  the  C  Is  level  at  284.6  eV  from  the  adventitious 
carboy^on  the  sample.  The  pressure  in  the  analysis  chamber  was  kept  below 
5x10  Torr  during  the  measurement. 

RESULTS 

To  review,  in  earlier  studies(7,8]  we  showed  by  using  AES  depth 
profiling  that  the  oxide  layer  formed  during  laser  irradiation  on  the 
gallium  arsenide  surface  immersed  in  pure  water  is  more  than  three  times 
thicker  than  that  made  by  irradiation  in  ambient  air.  However,  while  the 
oxide  is  arsenic  deficient  when  made  in  water,  it  is  nearly  stoichiometric 
when  made  in  air.  In  addition,  the  257-nm  laser  light  exposure  was  found  to 
enhance  the  oxidation  much  more  strongly  than  the  514-nm  laser  light  of  the 
same  power  density. 

The  XPS  spectra  showed  that  the  oxide  layer  contains  both  gallium  oxide 
and  arsenic  oxide.  The  As3d  peak  consists  of  well  resolved  components  from 
the  oxide  and  the  substrate  as  shown  in  Fig.  1. 


BINDING  ENERGY  [eV  ] 

Figure  1.  XPS  spectra  of  the  gallium  arsenide  surfaces 
irradiated  with  248-nm  laser  light  of  3  mj/cm  at  30  llz 
in  laboratory  air  ambient  for  a)  10  min,  b)  20  min,  c) 
32  min,  and  d)  40  min. 
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The  thickness  of  the  oxide  layer  can  be  estimated  using  the  relative 
intensities  of  the  oxide  and  substrate  of  tne  same  emission  level  in  the  XPS 
spectrum  as  shown  in  the  following  equation[9): 


I  D  .  A  . 

*sin0  *ln(-0*-  *  -sub  *  -5ub  +  1) 


where  I  ^  and  I  ^  are  the  intensities  of  the  photoelectrons  from  the  same 
atomic  fevel,  DoSuand  D  ^  the  densities  of  oxide  and  substrate,  and 

1  ,  the  electron  mlan  escape  depths  of  oxide  and  substrate,  dox  the 
thickness  of  the  oxide  layer,  and  8  the  exit  angle  of  the  photoelectrons 
relative  to  thr  surface.  The  electron  mean  escape  depths  in  gallium  arsenide 
have  been  determined! 10]  to  be  25  A  for  the  As3d  level  and  7  A  for  the 
As2p^,9  level.  We  thus  determinedothe  thickness  of  the  oxide  layers  shown 
in  Fig.  1  to  be  13,  17,  21,  and  26  A,  made  by  exposure  of  GaAs  surfaces  to 
248-nm  laser  light  of  3  mJ/cm  at  30  Hz  for  10,  20,  32,  and  40  minutes, 
respectively.  In  the  control  sample,  6  1  oxide  was  found  to  have  developed 
through  the  handling  in  air  after  the  chemical  cleaning  procedures. 
Subtracting  this  native  oxide  thickness  from  each  cf  the  above  thicknesses, 
a  linear  oxide  growth  rate  of  about  0.5  X/min  can  be  seen.  A  more  extensive 
study  of  the  wavelength  dependence  was  also  made.  As  shown  in  Fig.  2, 
exposure  of  the  deep  ultraviolet  laser  light,  either  257-nm  (cw)  or  248-nm 
(pulsed),  can  make  much  more  oxide  than  that  of  the  near  ultraviolet  or 
visible  laser  light  of  about  the  same  power  intensity  (ca.  50  mW/cin  ).  Note 
the  latter  longer  wavelengths  induce  only  a  small  amount  of  additional  oxide 
on  top  of  the  native  oxide  shown  in  the  open  circle. 


WAVELENGTH  (nm) 
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257  248 


PHOTON  ENERGY  (eV) 

Figure  2.  Oxide  layer  thickness  as  a  function  of  th 
wavelength  of  the  laser  light  irradiated  at  50  mW/cm 
for  20  minutes  each  in  laboratory  air  ambient. 
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Figure  3.  XPS  spectra  of  gallium  arsenide  surfaces 
irradiated  with  248-nm  laser  light  of  2  mj/cm  at  100  llz 
for  20  minutes  in  dry  oxygen  (lower  panel)  and  in  oxygen 
with  equilibrium  water  vapor  (upper  panel). 

Another  set  of  experiments  were  done  in  the  well-controlled  gas  ambient 
inside  an  UHV  chamber.  The  248-nm  light  was  introduced  through  a  suprasil 
window  onto  the  sputter-cleaned  and  annealed  surface  inside  the  chamber.  A 
drastic  difference  was  found  depending  on  whether  this  chamber  was  filled 
either  with  dry  oxygen  or  with  oxygen  bubbled  through  pijre  water.  Results 
are  shown  in  Fig.  3  for  20  minute  exposure  of  200  mW/cm  power  density.  In 
the  lower  panel  the  oxide  grown  in  dry  oxygen  is  shown  to  be  only  about  9  X 
in  thickness,  while  in  the  upper  panel  the  oxide  grown  in  water-vapor- 
saturated  oxygen  is  shown  to  be  about  31  X  in  thickness.  In  a  separate 
control  experiment,  oxygen-free  water  vapor  (obtained  by  prolonged  bubbling 
with  nitrogen)  was  let  into  the  chamber  up  to  its  equilibrium  vapor  pressure 
(  %  20  torr).  Subsequent  irradiation  of  the  surface  with  248-nm  light 
produced  no  appreciable  oxide. 

DISCUSSION 

We  have  not  yet  developed  a  complete  understanding  about  the  role  the 
carriers  play  in  this  optically-induced  oxidation  process.  However, 
unrelaxed  carriers  generated  by  the  deep-uv  light  illumination  are  more 
energetic  than  those  generated  by  the  near-uv  or  visible  light.  This  may 
explain,  at  least  in  part,  the  strong  wavelength  dependence  of  the  oxidation 
process. 
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Moreover,  based  on  the  previous  understanding  of  GaAs  surface 
oxidation,  we  can  develop  a  plausible  explanation  for  the  results  of  the 
enhanced  oxidation  through  the  presence  of  water  adsorbates.  It  has  been 
reported  that  the  adsorption  of  water  molecules  on  the  gallium  arsenide 
surface  at  room  temperature  will  lead  to  a  chemisorbed  adlayer  on  the 
surfacef 1 1 , 12  ] .  This  may  then  be  followed  by  the  dissociation  of  the  water 
molecule  into  chemically  bonded  -OH  and  -H  to  surface  gallium  atoms[12). 
The  oxidation  of  gallium  arsenide  is  believed  to  proceed  with  the  initial 
attachment  of  oxygen  to  arsenic  atoms[5,13].  We  postulate  that  the 
chemisorption  of  water  molecules  on  the  surface  will  weaken  the  adjacent  Ga¬ 
As  bonds  and  make  the  arsenic  atoms  more  susceptible  for  chemisorption  of 
oxygen  molecules.  Photogenerated  carriers  will  subsequently  induce  the 
breaking  of  the  molecular  oxygen  bonds  and  complete  the  oxidation  process. 
Thus,  a  much  more  rapid  growth  of  oxide  is  observed  with  the  presence  of 
water  vapor. 
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Abstract 

Tne  use  of  as«r  ass - sted-chem I stry  for  dry  etching  of  electronic  materials  'S  described 
£r*c>r  a  s  ■  s  is  p'aceo  or  the  use  of  i  ase ' -ass  i  sted  reactions  for  large  area  processing  Review 
of  the  current  technology  s  given  for  large  area  masked  etching,  UV-proj ect ■ on  etching,  ana 
'ase'  ass  stea  reactive  iO"  etching  (RIE),  snd  plasma  etching 

IntroduCt i on 


Many  new  : a s e r - c " e w  ■ c a '  te:kr,a.ej  <c'  ory  etching  have  bee'  ae v e ■ oped • f o'  Detent 'a 
acC  'CatiOn  I n  m i c 'oe ' ee t ron . c  fao'cat«on  The  applications  range  from  h  .  gh  resolution, 
planar,  UV-proj ect . on  pattern  transfer  to  maskless  fabrication  of  three  d'*enjiona  mcro- 
st'uctures  Most  of  these  techr.ques  have  ■  "  common  the  use  of  I ase r - i nouceo  o'  -enhanced 
ch«m,cal  reactions  at  a  soi’d-gas  o'  sol id-i 'Ouid  interface.  In  th . s  paper,  we  will  review 
the  status  cf  large  area  i aser-ass ■ sted  etching  of  materials  for  solid  state  electronics 
with  emphasis  on  gas  (dry)  amoe"ts  The  reader  interested  in  "direct  write"  etching  and 
basic  mechanisms  involved  in  laser  assisted  etching  is  referred  to  several  recent  reviews  1 

The  features  wh . ch  make  I ase'-ass . sted  etching  attractive  from  a  processing  perspective 
a'e  the  fast  etch  rates,  specific  chemistry,  low  substrate  temperatures ,  high  materia' 
seect'vty.  a'd  etc"  an.sot'opy  These  featu'es  are  a  resu 1 t  of  t"e  nature  of  the  lase'- 
initiatM  chem  st'y  involved  Specifically,  't  is  possible  to  selectively  dr.ve  chem.cai 
react'O's  a'ong  predeterm  ned  pathways  using  laser  eic ' tat'on  of  spec > f  ■  c  reaction  modes 
Por  ensmc  e.  us  ng  i  a s e r - . no u c ed  cnem.stry,  it  may  be  possible  to  produce  CF)  and'  Br  reac¬ 
tants  f'om  C F | B r  w  thout  the  concomitant  generation  of  other  species  (CF,  C,  CF,’,  etc  ). 
Such  as  those  produced  >n  p'asma  d  scharges  2  In  of-e"  forms  of  'ase'  p'oeess  ng.  e'er  ct 
reactions  are  d'  «e'  py  'ap'd  t'ansent  heating  of  tne  Suost'ate  oy  a  pulsed  ase'  By 
us  ng  a  snert-pu  i  sed  Hr  with  a  sma 1  I  surface  penet'aton  decth,  the  heat  ‘S  cor'('~ea  only 
to  t"e  su'face  Co'secu«"t i y ,  h . gn  temperatures  can  pe  reached  'r  the  nea'-su'face  reg  on 
<’  e  t'e  r»*i  nae-  0f  tne  samp , e  remains  at  the  ampient  tempe'ature  In  adat'O'.  the 
opt  ca  character  st  cs  of  the  laser  beam  itself  can  g  ve  the  etc'ing  process  oes ■ 'ab  e 
p-'ooert'es  Fo'  e-amp'e,  since  the  laser  beam  has  a  well  defined  direction,  h.ghiy 
an i sc  t  r  op  *  c  featu'es  car  be  produced  Finally,  focusing  or  imaging  the  laser  beam  enab  es 
unique  maskless  pattern  gene'at'On  on  surfaces  by  tne  techniques  of  "direct  writing"  or 
projection  pattern , ng,  respectively 

We  w.il  first  b''efiy  review  t”e  genera'  f  •  e  !  d  of  Itser-iss  stta  etching  of  e 1 ect 'on  c 
materials  and  then  review  the  cument  status  of  large  area  masked  laser  etching,  i aser- 
asS'Sted  project  on  etching  and  I ase r-enhanc ed  RIE  and  plasma  etching 

An  Overview  pf  Chem  ca'  Approaches  to  Lase'  Etching 

Cnem  ca1  approaches  to  i ase '- ass ■ sted  etching  of  electronic  materials  can  be  t'eated  •" 
terms  of  four  genera'  mechanisms  acco'd'ng  to  the  involvement  of  the  laser  i ,gnt  .n  seve'a 
basic  so  i  i d- su r f ace  and  gas  phase  molecular  processes  These  mechanisms  a'e  photochemica 
the'mai,  photon-act i vated  surface,  a no  aotat>ve  For  eiampie,  >n  photon-ass  sted  dry 
etch  ng  the  process  'S  dominated  by  photo"  interactions  with  gas-phase  or  su 'f ac e- adsorbed 
soec  i  es  ,  while  .n  direct  ablative  photoetching  photon-su r f ace  processes  dominate  the 
mate'-a1  remova  Table  I  categories  d'y  laser  etcn.ng  processes  ■"  te-ms  of  t"ese  *ou' 
mechan  sms 
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In  tha  cast  of  photochemical  dry  etching,  an  inert  parent  molecule  in  the  vicinity  of  a 
gas-solid  interface  is  dissociated  by  one  of  several  gas  or  adsorbed  phase  processes  to  pro¬ 
duct  a  reactive  atom  or  free  radicel .  This  species  reacts  with  the  surface  to  form  a  vola¬ 
tile  compound  which  then  leaves  the  surface,  thereby  resulting  in  etching  of  the  solid  In 
the  simplest  case,  sbsorption  of  a  single  UV  photon  resu I ts  in  dissociation  of  the  parent 
molecule  to  produce  reactants.  Alternatively,  multiphoton  encitation  by  a  pulsed  CQ}  laser 
has  been  used  to  highly  eneite  and/or  dissociate  the  gas  phase  parent  molecules.1* 

For  direct  ablative  photoetching,  the  laser  wavelength  is  selected  such  that  the  light  is 
highly  absorbed  in  the  surface  layer  of  the  substrate  Us i ng  lasers  which  deliver  pulses 
with  high  peak  powers,  the  surface  layer  is  removed  by  simple  evaporation  This  approach 
has  been  applied  to  a  wide  variety  of  inorganic  materials.*  In  a  novel  variation  of  this 
approach,  Koran  et  el.*  have  etched  thin  Ai  films  which  are  prereacted  with  molecular  chlo¬ 
rine  to  produce  AlCI,.  The  surface  layer  was  then  removed  by  XaC I  (308  nm)  I aser- i nduced 
evaporation.  This  procedure  has  been  used  to  etch  AI  films  at  a  rate  of  1  pm/pulse.  On  the 
Other  hand,  for  etching  of  polymers  with  UV  lasers,  e<plosive  removal  of  Surface  layers  is 
believed  to  be  caused  by  bond  cleavaga  photochemistry  in  the  surface  molecules  due  to  direct 
polymer  electronic  transitions  * 

For  etching  sem i conductor  substrates  irradiated  with  low  laser  powers  in  the  presence  of 
a  reactive  ambient,  e  i  ectron-ho i #  pair*  created  in  the  bulk  by  absorption  of  above  band  gap 
laser  light  in  the  bulk  have  been  shown  to  be  responsible  for  accelerated  desorption  and 
reactions  at  the  sem conductor  surface.*  At  h.gher  intensities,  thermal  e»citation  of  the 
sol  id  becomes  important.  The  heating  results  from  a  number  of  phenomena,  including  no»- 
radative  recomb ! nat ■ on  and  energy  transfer  from  hot  electrons.  The  thermal  esc i tat  I  on  i s 
needed  to  drive  chemical  reactions  over  an  energy  threshold  on  the  surface  7  Parent  mole¬ 
cules  are  then  selected  which  will  on l y  react  with  the  local  heated  surface. 


UV-In ' t i ated  Dry  Etching  of  Masked  Substrates 


Many  sem i conductors  materials  such  as  GaAs  are  particularly  difficult  to  etch  in  c'»s*i 
env i rp-iments  because  of  the  surface  damage  created  by  energetic  charged  particles  Th  , %  is 
a  1  so  true  of  etching  of  substrates  having  thin  oside  layers  or  structures  In  order  to 
overcome  these  problems  while  keeping  the  desirable  anisotropic  cha racter . st ■ cs  c{  p;is«i 
etching,  UV- . n i t ' ated  dry  etching  .s  being  developed  as  a  low  energy,  neutral  charge  t»ch- 
nuaue  having  applications  to  large  area  processing,  E * c i me r  laser  dry  etching  has  oeen 
demonstrated  for  the  masked  etching  of  GaAs,  poiy-Si  and  S'  *"11  In  tha  case  of  GaAs , * ■ *  an 
ArF  «»o»tr  laser  has  been  employed  to  effect  Single  photon  dissociation  of  e.ther  CF,Br. 

CH j 0 r ,  o r  H0r  parent  gases  to  produce  primarily  Br  atom  etchants.  For  both  poiy-S'  and  S', 
m  i'v'is  of  the  etchant  parent  gas  and  a  polymerizing  agent  were  employed  to  etch  the  sub- 
st-ate  wr  .  i e  producing  an ■ sotrop 1 c  S'de-waii  features. 


Figure  1  snows  the  etch  rate  of  GaAs  (ZOO)  as  a  function  of  ArF  laser  energy  arS'ty  at 
two  temperatures  when  HB'  's  used  as  the  parent  gas  Masking  of  the  GaAs  subst-ac.es  was 
■»:compi ■ sneo  us  ng  photoresists  which  do  not  ablate  during  the  e«C’mer  lase-  Du  se  Lne- 
•'dth  resout'or  be ' ow  1  (if  has  been  demonstrated  '  Etch  rates  of  up  to  8  Mm/min  * , tn  large 
area  substrates  heated  tn  60*C  were  obtained  using  moderate  »s«'  inters, tes  (<35  "J/cr1)  * 
Seiectve  crystal lographic  etching  was  observed  (Figure  2)  and  etch  rates  fcr  the  lower 
orae'  cr,vsta;  Canes  were  measured  to  be  <  1 1 1  }B>{  100) >{  1 10} >{  11 1  >*  The  etching  process 
seems  to  be  dom , rated  by  reactions  between  Br  raccals  and  the  GaAs  Surface,  etc-'ng  prod¬ 
ucts  are  pr,mar>iy  the  bromides  of  gallium  ano  arsenic  I"  these  e»per.ments.  no  e>  Pence 
of  As  depletion  of  the  surface  layer  has  been  ooserved 


For  the  masked  e»c I »er  laser  initiated  etchi ng  of  po  t  y  —  S • 1  *  and  Si,11  Hor iii<e  and  co- 
worktrj  at  the  Toshiba  Coro  have  developed  a  novel  se i f -mask i ng  technique  to  confo  S'de- 
wa I  I  profiles  during  processing  py  using  etchsnt  gas/monome r  mixtures  irradiated  with  an 
e«cmer  laser.  In  their  initial  work,1*  a  XeC  I  e*cimer  laser  (308  nm)  is  used  to  pncto- 
dissociate  a  ml*ture  of  Cij  and  Si(CH,)4.  Tha  Cl  atoms  generated  by  the  photod ' ssoc at ' on 
process  served  both  to  etch  the  substrate  and  to  cause  free  radical  po I yme r i xat 1 o"  o*  the 
S' (CH,)4  monomer  Po l ym#r i zat ' on  produces  s  nonvolatile  poly-methylated  S'  surface  film 
Direct  illum.nat'on  by  the  XeC I  laser  normal  to  the  substrate  causes  ablation  of  the  depos¬ 
ited  film  Th i s  results  in  an i sotrop ' c  etching  since  the  sidewalls  (parallel  to  the  >ase- 
Omani  remain  c'ptecfed  d.-'-o  etch,--  T-e  chip-  n.  -hem  stry  produces  se'ect've  etcn.-g  c ‘ 
the  S'  or  boiy-S*  reiat've  tc  the  5'0j  masking  materia'  As  a  result,  the  process  <s  pcte-- 

t'a’iy  usefui  for  the  etcn.ng  sub* ' c rometer  features  with  very  thin  gate  ox. das  Note, 
however,  that  care  must  be  taken  with  UV  i  i  luffiinit'O0  to  prevent  photogene  ra  ted  f  ro* 

b« ■ Ag  produced  under  the  gate  o««de  An  alternative  approach  to  t^ < *  techn.oue  *•*  been 
reported  11  In  these  e»per « ,  a  *'C',o*ave  discharge  was  used  to  produce  the  etchant  C) 
'•ad-cais,  and  me thy i -methac r> i ate  (MMAj  was  used  as  the  polymer  precursor  These  changes 
enabled  the  d^coup1  •  ng  of  the  etchant  production  from  the  ablation  of  the  po 1  yrr*  r ,  zees  sur¬ 
face  films  The  implementation  of  th,s  techno i ogy  in  the  f ibr i cat i on  of  a  1  Mb • t  DRAM  ce 
has  been  reported  1 1 
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EXCIMER  LASER  (193nm,  30Hx) 


O  •  Sub*lrote  T*mp  60*C 
x  ■  Sub*tfOt*  Ttmp  23*C 


Projection  etching  has  significant  advantages  for  the  fabrication  of  * ■ croe ■ ectron i c 
circuits  since  the  character i st 1 cs  of  laser  processing  are  combined  in  both  a  paral le1  and  a 

direct  pattern  transfer  technique.  Pattern  transfer  has  been  accomplished  by  imaging  a 

homogenized  laser  beam  through  a  photomask  onto  a  solid  surface  to  produce  patterned  mcro- 
chemicel  reactions  This  scheme  has  been  applied  to  the  etching  of  pyrex  glass,12  GrAs,11 
and  poly-Si,11,14  although  recently  Surface  product  ablative  patterning  of  Al  films*  has 

been  demonstrated  in  a  manner  similar  to  that  used  for  etching  polymers  *  In  work  on  Mo  and 

poly-Si,14  radiation  present  in  near-field  diffraction  patterns  has  been  used  to  photochem¬ 
ical  ly  initiate  etching  reactions  without  substrate  masking.  A  resolution  of  <0.30  (i*  was 
achieved  using  this  process  In  all  of  these  cases,  excimer  lasers  have  been  used  as  the 
illumination  source.  The  primary  advantages  of  these  sources  are  their  short  wavelengths 
and  high  average  and  peak  powers  Relatively  simple  optical  imaging  systems  have  been 
employed  with  typical  experimental  setups  using  standard  photomasks  and  1:1  imaging  lenses 


For  higher  resolution  reflective 
achieve  etch  features  of  <1  m m. 


icroscope  systems  (35.1  reduction)  have  been  used1 


For  the  projection  etching  of  CaAs , 1  *  poly-Si,11,14  and  Mo,14  a  gas-phase  photochem¬ 
ical  ly  Initiated  process  relying  on  the  dissociation  of  ha  I ogen-bea r i ng  parent  gases  was 
employed.  The  physical  cons i derat i ons  in  these  experiments  are  different  from  those  of 
masked  substrate  etching  since  parent  gas  pressure,  surface  temperature,  and  laser  intensity 
al I  affect  the  ultimate  image  resolution  through  gas  or  surface-mediated  diffusion  In  the 
case  of  CaAs,  the  addition  of  reactive  and  inert  buffer  gases11  have  been  found  to  be  useful 
in  controlling  the  etch  linewidth  (Figure  3);  however,  this  results  in  lower  etch  rates  In 
these  experiments,  the  confinement  of  the  etching  gas  diffusion  appears  to  stem  from  tne 
reduction  of  the  mean  free  path  by  physical  col  I  isions  or  chemical  reactions.  However, 
compared  to  theoretical  fits  to  the  experimental  parameters,  the  etched  linew.dths  have  been 
found  to  be  sma i ler  by  a  factor  of  5.  The  order  of  the  reaction  of  the  B r  atoms  with  the 
GaAs  Surface  has  been  useo  to  explain  th . s  large  decrease  in  linewidth  Physca;iy,  tn  .  s 
can  be  understood  in  terms  of  a  concentration  I  ifetime  due  to  the  non  I  i ntor  order  of  the 
surface  reaction  which  limits  the  etching  contribution  of  the  initial  projected  reagent  flux 
to  less  than  one  mea'  free  path  above  the  surface  11  In  other  words,  the  'act  that  the 
etching  is  nonl.near  in  reactant  flux  means  that  the  Surface  regions  which  encounter  a  high 
atom  flux  are  etched  much  faster  than  the  low  concentration  regions.  The  result  of  th . $ 
effect  s  that  the  etched  image  more  closely  reproduces  the  initial  projected  image 


SUfff  A  GAS  hSESSUAf  Tori 


Tgure  3  Lmewidth  of  etched  grooves  are  plotted  as  a  function  of  buffer  gas  pressure 
T»o  physical  quenchers  (Ar*0  and  H2*fi)  and  a  chemical  scavenger  (ethylene  D) 
were  m.xed  with  the  HBr  parent  gas  The  optical  feature  s>ze  of  the  projected 
i  .  ne  image  was  3  5  Mm. 


In  oth«r  projection  work,  waveguided  aqueous  etchino  of  GaAs  has  been  extended  to  be  a 
parallel  process  by  the  use  of  a  UV  projection  scheme.  *  In  these  experiments,  the  light-o 
a  KrF  excimer  laser  (248  nm)  is  projected  onto  a  GaAs  substrate  which  is  placed  in  a  quartz 
cell  containing  a  1  to  51  HNO,  solution.  An  array  of  high  aspect  ratio  holes  (4  pm  diam¬ 
eter)  have  been  produced  without  masking  the  substrate  as  shown  in  Figure  4. 
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Figure  4  Seaming  electron  micrograph  of  I aser- i nduced  wet  etched  CaAs  (100)  using  5% 


and  a  KrF  e>ci*er  lase’ 


Projection  imaging  by  4  1  optical  reduction  was 


employed  to  produce  these  high  resolution  holes  (4  mm  diameter). 

The  above  approaches  are  in  contrast  to  the  direct  physical  ablative  etching  of  thin 
polymer*  and  Al*  films  where  the  laser  ,s  not  used  to  initiate  external  chemistry  It  is 
thought  that  in  the  case  of  polymer  films  at  low  laser  powers  that  electronic  transitions  ir 
the  surface  layer  control  the  etching  For  organic  resists,*  experiments  have  demonstrated 
<1  pm  reso'ut'on  In  add i t i on ,  ablation  has  been  achieved  at  laser  energy  densities  as  low 
as  10  mj/cm2  For  ablative  etching  of  Al  films,  the  surfaces  are  exposed  to  molecular 
chlorine  to  form  an  A ! C I j  outer  layer  which  is  ablatively  desorbed  using  a  0  25  to  3  J/cm2 
XeC l  excimer  pu  ■  se  Etcn  rates  up  to  1  pm/ pulse  have  been  acn.eveo  in  these  exp«-lrie"t$ 

This  approach  is  valuable  for  some  materials;  however,  for  electronical ly  sensitive  mater,- 
a  s.  such  as  compou-io  sen i conductors .  d  r,ct  physical  ab i at ' on  would  be  deleterious  to  the 
surface  stoichiometry  due  to  Surface  heating. 

Lase-  Enhanced  RIE  and  P ; asma  Etchrg 

The  use  of  'asers  In  conjunction  with  RIE  and  plasma  etching  reactors  has  only  recently 
been  reported  181  ■'*  In  one  approach,  a  port,  in  the  top  electrode  of  a  convent'ona1  reac- 


'  s  used  to 


’t'oduce  the  laser  light,  which  can  be  focused 


or  projected*  by  an 


external  lens  system  onto  the  sem i conductor  surface  Localized  patterns  can  be  etched  <« 
the  area  illuminated  by  the  laser  With  the  use  of  projection  etching,  mode'ate  image  reso¬ 
lution  <io  pm  has  been  acn.eved  18  The  limitation  in  feature  size  wh.ch  has  been  achieved 
■n  this  work  is  due  to  the  simple  lens  system  employed  A  second  approach  uses  the  laser 
i ight  to  alter  the  external  gas-phase  chemistry  above  the  sem , conductor  18  This  al lows  the 
separate  control  c*  the  production  of  gas-phase  neut'ai  chem.cai  species  and  >on  bombardment 
in  the  plasma  etcher  In  these  experiments,  the  output  of  a  193  nm  ArF  excimer  laser  was 
directed  over  a  GaAs  sample  in  a  pu 1 sed  discharge  of  Ar  and  H8 r 

There  are  several  potential  advantages  to  using  I aser-contrc I  I  ed  chemistry  In  conjunction 
with  a  plasma  reactor  First,  the  light  beam  can  be  used  to  define  a  region  to  be  etched 
without  the  necessity  of  prior  surface  masking  Second,  light  can  enhance  the  etch  rate  of 
materials  With  otherwise  slow  etching  rates  in  conventional  reactors  In  addition,  light- 
induced  desorption  may  reduce  the  number  of  surf  ace-adsorbed  product  molecules  without 
requiring  intense  phys i ca I  Sputtering 

The  mech  an  isms  in  the  direct  surface  excitation  orientation  have  been  shown  to  fail  >  n to 
two  classes,  namely  thermal  ano  northermai  No"thermj i  effects  observed  at  low  laser  inten¬ 
sities  have  been  shown ,  in  the  case  of  S'  I  ' con ,  to  depend  on  dopant  concentrations  and  'ase- 
wavelength  18  The  doc ■ ng  dependence  suggests  that  the  photo-enhancement  it  related  to  the 
production  of  photegenerated  carr  erj  It,  has  been  suggested  that  a  flow  of  minority  car¬ 
riers  to  the  surface  reaction  sites  gves  rise  to  enhanced  surface  reaction  rates  Th , s  has 


'  v*  w>  o'  iTT.’Yyyy 


also  been  observed  in  laaar  enhanced  itchi^a  of  Si  in  XeF}  (no  plasma)*  and  in  »i«ct>-od«- 
I  ass  I  aser- i nduced  wet  etching  of  Ca As.1*  At  higher  laser  intensities,  the  local  temper¬ 
ature  may  reach  several  hundred  degrees  at  the  center  of  the  laser  focal  spot  on  the  wafer 
depending  on  the  focal  spot  size  and  the  laser  power.  Etch  rates  of  25  times  above  the 
dark  etch  rate  for  Si1*  and  10  times  for  GaAs17  have  been  obtained  by  localized  heating  of 
the  sen i conductor  surface.  This  enhancement  may  result  from  product  desorption,  enhanced 
rates,  or  removing  adlayers  which  block  the  surface  reaction  sites. 


In  the  case  of  gas  phase  e»citation  by  a  193  nm  enciner  laser  in  pulsed  HBr/Ar  dis¬ 
charges,1*  the  densities  of  neutral  species  was  controlled  independently  from  the  i on 
bardment.  Optical  emission  studies  showed  the  increased  production  of  reactive  neutra 
radical  species  An  added  advantage  observed  using  this  combination  was  that  the  plasma 
breakdown  vol  ’.age  was  lower  when  the  laser  pulse  was  synchronized  to  the  discharge  pulse 
This  dependence  also  yielded  lifetime  information  about  the  I aser-produced  radicals.  Sub¬ 
stantial  changes  in  the  etching  characteristics  due  to  the  laser  chemistry  were  also 
observed  with  m i crostructure  wall  definition  changing  from  vertical  to  C rysta I  I ograph i c 


on  bom - 


Cone  I  us i ons 


Laser-assisted  etching  has  been  shown  to  be  a  powerful  method  for  processing  electronic 
materials  in  regimes  where  plasma  or  RIE  techniques  fail.  The  most  attractive  features  o( 

•  aser-ass  i  sted  etching  include:  maskless  processing,  etching  with  neutral  species,  and  pro¬ 
duction  of  an ■ sotroo  *  c  features.  For  most  materials,  laser  etching  has  been  shown  to  nave 
Superior  eten  rates  and  material  selectivity  relative  to  plasma  etching.  Resolution  of 
below  1  n m  in  both  masked  and  projection  laser  etching  has  been  demonstrated  and  has  been 
found  to  be  competitive  with  most  other  current  processing  techn.ques.  The  most  important 
limitation  to  date  >s  the  need  for  more  reliable,  high  power  UV  lasers. 
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DIRECT  CONNECTION  OF  OPTICAL  FIBERS  TO  INTEGRATED  CIRCUITS 
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S.R.  Fossum,  P.R.  Prucnal,  and  R.M.  Osgood 

Department  of  Electrical  Engineering  and 
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1312  S.W.  Mudd  Building 
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(212)  280-3107 

ABSTRACT 

We  have  been  investigating  novel  structures  for  coupling  optical 
fibers  directly  to  integrated  circuits.  The  coupler  features  a  small 
footprint,  reliable  alignment,  and  mechanical  stability,  and  is 
suitable  for  high  density  optical  fiber  interconnects.  At  the 
present  time,  we  have  demonstrated  a  high  quality  (silicon)  receiver 
structure  fabricated  using  these  techniques.  The  receiver  consists 
of  a  vertical  anisotropic  cavity,  formed  using  a  laser-assisted 
etching  process,  which  is  then  doped  to  form  a  P-N  junction 
detector.  A  tapered  optical  fiber  is  inserted  into  the  cavity  and 
affixed  in  place  by  means  of  a  U.V.  cured  adhesive.  The  performance 
of  this  receiver  is  significantly  improved  over  that  which  we 
reported  earlier  this  year1.  The  detector  responsivity  is  0.17  A/w 
(632.8  run)  and  the  dark  current  is  less  than  0.5  nA.  We  are 
presently  making  switching  speed  measurements,  but  the  performance  is 
expected  to  be  comparable  to  other  silicon  P-N  junction  detectors. 

The  paper  will  briefly  discuss  the  fabrication  process  of  both  the 
receiver  and  the  optical  fiber.  The  performance  properties  of  the 
receiver  as  obtained  at  the  time  of  the  conference  will  be  reported. 
We  will  also  report  our  progress  on  backside  interconnect  structures, 
III-V  device  interconnects,  and  optical  source  structures,  which  are 
currently  under  investigation.  Finally,  the  paper  will  address  the 
problems  we  have  encountered  in  mounting  and  packaging  this  device, 
and  the  implications  of  this  sort  of  optical  interconnection  for  the 
future . 

1.  P.R.  Prucnal,  E.R.  Fossum,  and  R.M.  Osgood,  "Integrated 
Fiber-Optic  Coupler  for  Very  Large  Scale  Integration 
Interconnects",  Opt.  Lett.,  11,  109,  1986. 


Wavelength  dependence  of  optically  Induced  oxidation  of  GaAs(IOO) 
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The  wavelength  dependence  of  optically  induced,  room-temperature  oxidation  of  GaAs  ( 100)  is 
reported.  The  sample  illumination  was  performed  with  either  continuous  wave  (cw)  or  pulsed- 
laser  sources  radiating  at  different  wavelengths,  in  the  range  of  248-514  nm.  The  thickness  of 
optically  induced  oxides,  ranging  from  1  to  >  50  A,  was  measured  with  x-ray  photoelectron 
spectroscopy.  It  was  found  that  the  oxidation  is  always  much  more  rapid  in  deep-ultraviolet  than 
in  near-ultraviolet  or  visible  wavelength  regions.  This  effect  is  attributed  to  the  generation  of  hot 
carriers  at  the  semiconductor  surface  under  ultraviolet  light. 


I.  INTRODUCTION 

The  enhancement  of  chemical  reactions  on  semiconductor 
surfaces  by  illumination  with  photons  is  well  known.  In 
many  cases,  this  enhancement  occurs  as  a  result  of  photo¬ 
generation  of  electron-hole  pairs,  with  a  subsequent  step  by 
one  of  the  carriers  leading  to  or  catalyzing  a  chemical  reac¬ 
tion  at  the  semiconductor  surface.1-3 

Such  reactions  are  of  practical  interest  because  of  the  po¬ 
tential  for  utility  in  the  fabrication  of  semiconductor  devices. 
For  example.  III— V  and  II— VI  semiconductors  often  require 
processes  that  accelerate  chemical  reactions  while  keeping 
the  semiconductor  near  room  temperature.  It  has  not  been 
generally  appreciated,  however,  that  carrier-related  reac¬ 
tions  exhibit  strong-band-structure  dependence  when  differ¬ 
ent  photon  energies  are  used.  In  GaAs  crystals,  for  example, 
visible  and  ultraviolet  photons  are  absorbed  in  different 
parts  of  the  Brillouin  zone,  thus  generating  electrons  and 
holes  of  different  energies.  In  this  paper,  we  will  present  the 
results  of  a  study  of  the  light-enhanced  oxidation  of  GaAs.  It 
will  be  shown  that  the  optical  properties  of  GaAs  at  deep 
ultraviolet  wavelengths  lead  to  more  rapid  oxidation  reac¬ 
tions  than  those  seen  in  visible  wavelengths.  The  enhanced 
ultraviolet  oxidation  was  then  used  to  grow  relatively  thick 
oxides  (  >  50  A )  at  room  temperature. 

II.  OPTICAL  PROPERTIES  OF  GaAs 

In  our  investigation  of  the  laser-enhanced  oxidation  of 
GaAs.  we  are  concerned  with  the  optical  properties  of  GaAs 
in  the  visible  and  ultraviolet  wavelength  range.  Measure¬ 
ments  of  normal  incidence  reflectance,4  and  more  recently, 
measurements  using  spectroscopic  ellipsometry'  have  pro¬ 
vided  optical  data  for  the  intrinsic  GaAs  surface.  The  results 
for  the  wide  spectral  region  0-1 5  eV  can  be  best  summarized 
by  the  complex  dielectric  function.  In  the  visible  and  ultra¬ 
violet  region.  2-5  eV,  the  dielectric  function4-5  varies  rapidly 
and  is  sharply  structured.  Philipp  and  Ehrenreich4  com¬ 
mented  on  the  spectral  region  at  higher  energies  than  the 
intrinsic  absorption  edge  (the  band  gap).  They  noted  that 
the  sharp  structures  for  hv  between  2  and  5  eV  were  associat¬ 
ed  with  the  vertical,  valence-to-conduction-band  transitions 
in  various  parts  of  the  Brillouin  zone.  For  example.  3  eV 
photons  induce  a  transition  at  the  L-tdgt  of  the  Bnlloum 
zone.'’  i.e..  in  the  (111)  direction  (see  Fig.  1,  after  Ref.  7), 
Similarly,  5  eV  photons  induce  an  .Y-edge  transition'’  in  the 


[100]  direction.  As  a  compliment  to  the  dielectric  function, 
the  optical  coefficient  curve8  can  be  extracted  from  the  same 
measurements.  The  distinct  absorption  peaks  of  the  absorp¬ 
tion  coefficient  for  photon  energies  near  3  and  5  eV  can  be 
associated  with  the  above-mentioned  valence  to  conduction- 
band  vertical  transitions  in  the  Brillouin  zone.  An  important 
effect  is  that  going  from  the  green  to  the  deep  ultraviolet,  the 
absorption  coefficient  increases  by  a  factor  of  more  than  20. 
As  a  result,  in  the  ultraviolet  region,  the  photogeneration  is 
confined  to  within  a  few  monolayers  of  the  GaAs  surface, 
e.g.,  50  A  for  250  nm  photon  illumination. 

Photon  absorption  in  different  parts  of  the  Brillouin  zone 
results  in  the  generation  of  electrons  and  holes  of  different 
energies.  In  GaAs  crystals,  visible  light  photons,  hv~  2.5  eV, 
are  absorbed  by  the  electronic  transitions  in  the  center  of  the 
Brillouin  zone.  Inspection  of  the  energy  band  diagram.  Fig 
1,  shows  that  the  excess  energy  is  transferred  mainly  to  the 
electrons.  However,  ultraviolet  photons,  hv  —  5eV,  induce  a 
transition  at  the  A"-edge  of  the  Brillouin  zone,  and  holes  now 
receive  most  of  the  excess  photoenergy.  Specifically,  the  ul¬ 
traviolet-generated  holes  are  created  with  energies  of  —2.5 


l  r  *  r 

REDUCED  WAVE  VECTOR 


Fig  1  Electron  energy  vs  reduced  wave  vector  for  the  four  GaAs  'alence 
bands  and  the  first  several  conduction  bands,  after  Ref  ' 
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eV  below  the  absolute  top  of  the  valence  band.  Notice  that 
since  these  holes  are  produced  spatially  within  5  nm  of  the 
surface,  energy  relaxation  is  incomplete  for  carriers  reaching 
the  surface.3 

A  consequence  of  using  these  nonthermal  carriers  is  that 
the  interface  chemistry  is  altered  under  ultraviolet  illumina¬ 
tion.  In  this  work,  we  show  that  ultraviolet  light  is  sufficient¬ 
ly  actinic  that  rapid  oxidation  reactions  occur,  in  contrast  to 
visible  light,  for  which  under  the  same  conditions,  no  effect 
of  a  comparable  magnitude  could  be  observed. 

III.  EXPERIMENT 

Initial  oxidation  experiments  were  carried  out  with  a  con¬ 
tinuous-wave  (cw )  argon-ion  laser  either  tuned  to  one  of  its 
visible  or  near-ultraviolet  lines,  or  with  a  frequency-doubled 
argon-ion  laser  to  generate  257-nm  deep  ultraviolet  light.  In 
all  cases  the  laser  intensity  was  ~50  mW/cm2.  For  the  ex¬ 
periments  carried  out  in  the  ultrahigh  vacuum  (UHV)  sys¬ 
tem,  a  high-repetition  rate  excimer  laser  was  used  with  a 
wavelength  of  either  248  nm  (KrF,  1  mJ/cm2,  100  Hz)  or 
351  nm  (XeF,  0.2  mJ/cm2,  500  Hz). 

For  the  liquid-phase  oxidation  experiments,  the  sample 
was  placed  in  a  quartz  cell  containing  20  Mft  deionized  wa¬ 
ter.  Before  each  experiment,  N:  was  bubbled  through  the 
water  to  remove  any  residual  oxygen.  The  gas-phase  oxida¬ 
tion  was  performed  either  in  laboratory  air  or  in  the  prepara¬ 
tion  chamber  of  an  UHV  system  with  a  base  pressure  of  1 0  _ " 
Torr. 

The  gallium  arsenide  samples  used  in  this  experiment 
were  n-type  of  ( 100)  orientation,  doped  with  Si  to  n  =  10'" 
cm-3.  A  comparison  of  oxidation  rates  was  made  using  p- 
type  GaAs  doped  with  Zn  to  p  —  10'"  cm  “  \  Before  oxida¬ 
tion  experiments,  the  semiconductor  surface  was  cleaned 
with  trichloroethylene,  acetone,  methanol,  deionized  water, 
and  50%  NH4OH  aqueous  solution.  The  sample  was  then 
dried  with  N;.  In  addition,  samples  used  in  the  UHV  system 
were  subsequently  heated  to  540  ”C.  treated  with  mild  argon- 
ion  sputtering  ( 500  eV,  1  p A/cm2,  10  mm )  to  remove  resid¬ 
ual  oxide  and  carbon  from  the  surface,  and  then  annealed  at 
540  ‘C  for  5  min. 

The  thickness  of  the  oxide  was  determined  in  an  UHV 
multiprobe  system  described  previously.9  The  x-ray  photo¬ 
electron  spectroscopy  (XPS)  data  was  taken  using  A1  Kct 
rays  (1486.6  eV)  with  a  background  pressure  of  ~10-'° 
Torr.  An  estimate  of  oxide  thickness  is  made  by  using  the 
area  ratio  of  the  shifted  oxide  peak  and  the  substrate  peak  of 
the  same  core  level, 10  either  As  Id  or  As  2p3/2,  depending  on 
the  oxide  thickness.  The  mean  escape  depths  for  electrons  in 
GaAs  have  been  determined  to  be  25  A  for  the  A4  3 d  level, 
and  7  A  for  the  As  2p3/2  level."  This  allows  us  to  determine 
oxide  thickness  in  the  range  of  1-50  A.  Auger  electron  spec¬ 
troscopy  (AES)  depth  profiling  has  also  been  used  as  a  rela¬ 
tive  measure  of  oxide  thickness.9 

IV.  LIQUID-PHASE  OXIDATION 

There  is  a  considerable  amount  of  work  reported  in  the 
literature  concerning  reaction  products  on  GaAs  surfaces 
formed  in  an  aqueous  ambient. 1213  An  important  reason  for 


studying  the  liquid-phase  oxidation  of  GaAs  is  that  semicon¬ 
ductor  dissolution  always  involves  an  oxidation  state 
change. 14  Typically,  the  oxide  growth  is  controlled  by  apply¬ 
ing  an  external  bias,  i.e.,  anodic  oxide  growth. 

In  our  experiments,  however,  the  room-temperature  ox¬ 
ide  growth  in  deionized  water  was  obtained  by  illuminating 
the  semiconductor  surface  with  cw  laser  sources  radiating  at 
different  wavelengths.  The  measured  oxidation  rates  for 
GaAs  have  been  shown  to  exhibit  a  strong  wavelength  de¬ 
pendence.  The  oxidation  is  much  more  rapid  in  deep-ultra- 
violet  than  in  near-ultraviolet  or  visible  wavelength  regions. 
Detailed  x-ray  photoelectron  spectroscopy  (XPS)  and  Au¬ 
ger  analysis  of  the  oxide  layer  produced  in  our  liquid-phase 
experiments  has  been  reported  elsewhere,9  thus,  only  a  brief 
description  is  given  here.  Independent  of  the  laser  wave¬ 
length  used,  compositional  analysis  of  the  oxides  showed  the 
almost  complete  depletion  of  As.  The  grown  overlayer  was  a 
Ga-rich  oxide.  This  is  consistent  with  Pourbaix  diagrams'5 
on  Ga  and  As,  i.e.,  with  the  known  relative  solubility  of  Ga 
and  As  oxides.  Deionized  water  can  oxidize  GaAs,  but  it  will 
also  attack  the  photon-enhanced  oxide  to  dissolve  some  of 
the  oxidation  reaction  products.  For  example,  As203  being 
highly  soluble  in  aqueous  solutions  of  all  pH  values,  will 
dissolve  in  water  at  pH  =  7.  Ga203,  however,  is  insoluble  in 
water  and  thus  remains  on  the  surface.  This  known  depend¬ 
ence  of  the  solubilities  of  As203  and  Ga.O,  on  solution  pH 
accounts  for  the  chemical  composition  of  the  grown  over¬ 
layers.  Finally,  it  is  important  to  point  out  that  the  nonstoi¬ 
chiometry  of  oxides  formed  in  a  liquid-phase  has  been  used 
to  alter  the  formation  of  metal-semiconductor  interfaces1'' 
and  is  therefore  a  subject  of  on-going  interest. 

Due  to  the  thickness  of  the  oxide  layer  grown  in  the  liquid 
phase,  AES  depth  profiles  were  used  to  determine  the  wave¬ 
length  dependence  of  the  oxide  growth  in  water.  Visible  light 
illumination  did  not  produce  a  significant  change  in  the  oxi¬ 
dation  kinetics,  i.e.,  the  measured  oxide  thickness  was  com¬ 
parable  with  a  typical  native  oxide  thickness  value.  This  be¬ 
havior  contrasts  sharply  with  that  observed  for  irradiation 
with  ultraviolet  light  In  the  latter  case,  the  measured  oxide 
thickness  was  much  larger.  For  example,  Fig.  2  shows  the 
data  for  a  250-A-thick  oxide,  grown  after  20-mm  illumina¬ 
tion  with  257-nm-laser  light  in  deionized  water.  The  corre¬ 
sponding  thickness  for  an  oxide  layer  grown  in  5 1 4  nm  light 
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Fig  2  AES  depth  profile  of  a  GaAs  sample  immersed  in  water  and  exposed 
to  257  nm  laser  irradiation 
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and  that  grown  without  laser  irradiation  was  80  and  70  A, 
respectively. 

The  observed  rapid,  ultraviolet-induced  oxidation  in 
deionized  water  requires  injection  of  holes  from  GaAs  to  the 
H20/02  redox  level. 14  This  redox  level  is  centered  around 
0.4  eV  below  the  top  of  the  GaAs-valence  band. 17  Therma- 
lized  holes  in  the  valence  band  are  not  able  to  access  this 
redox  level  efficiently  and  activate  the  rapid  reaction;  thus 
hot,  nonthermalized  holes  are  required  in  GaAs-water  in¬ 
terfaces  to  obtain  the  observed  reaction  rates.  Some  previous 
experiments  have  shown  that  hot  electrons  are  active  in  pro¬ 
moting  surface  chemical  reactions.1*  More  recently,  Bou¬ 
dreaux  et  al. 19  have  discussed  the  possibility  that  holes  or 
electrons  reach  the  semiconductor-surface  interface  as  hot 
carriers  and  can  in  this  way  initiate  reactions  that  would 
need  more  energy  than  that  available  at  the  band  edge.  It  is 
reasonable,  therefore,  that  the  hot  holes  excited  in  GaAs 
cause  the  oxidation  reaction  observed  in  liquid-phase  oxida¬ 
tion. 

V.  GAS-PHASE  OXIDATION 

Wavelength-dependent  enhancement  of  the  GaAs  oxida¬ 
tion  reaction  has  also  been  observed  in  a  series  of  experi¬ 
ments  performed  at  room  temperature  in  gaseous  ambients, 
like  the  liquid-phase  oxidation,  the  gas-phase  oxidation  is 
found  to  be  more  strongly  enhanced  by  ultraviolet  light  than 
by  visible  light.  In  the  gas  phase  experiments  performed  in 
laboratory  air,  the  samples  were  chemically  cleaned,  irra¬ 
diated  with  either  cw  or  pulsed  sources,  and  inserted  into  the 
UHV  system  for  analysis.  The  primary  method  of  determin¬ 
ing  the  thickness  of  the  grown  oxide  was  the  comparison  of 
XPS  peak  areas.  In  contrast  to  liquid-phase  oxidation,  XPS 
showed  that  the  oxide  layer  grown  in  gas  phase  was  stoichio¬ 
metric  with  regard  to  arsenic  and  gallium,  as  has  been  gener¬ 
ally  reported.2'*'72  The  peak  area  ratio  of  the  As  2 p,l2  to  the 
Ga2 pJ/2  for  a  clean  surface  was  —1.4.  The  ratio  for  the 
oxide  layer  was  also  found  to  be  —  1.4.  This  ratio  has  been 
found  to  correspond  to  the  photoemission  of  a  stoichiome¬ 
tric  GaAs  surface  grown  by  MBE.23 

A  comparison  was  made  of  n-type  GaAs  oxidation  and  p- 
type  GaAs  oxidation  under  similar  conditions  with  different 
wavelength  irradiation  in  air.  A  typical  result  is  when  257- 
nm  cw  laser  irradiation  was  used  for  20  min  in  moist  air.  The 
oxide  thickness  on  n-type  substrate  and  p-type  substrate  was 
21  and  20  A,  respectively.  Our  results  agree  with  previously 
reported  studies  where  no  difference  was  seen  between  the  p- 
type  and  the  n-type  oxidation  rates.20"22  However,  previous 
studies  have  not  reported  the  effect  of  deep-ultraviolet  light 
on  the  oxidation  of  GaAs. 

The  relation  of  wavelength  of  irradiation  to  oxide  thick¬ 
ness  for  fixed  exposure  time  was  initially  investigated  in  lab¬ 
oratory  air.  A  summary  of  results  is  shown  in  Fig.  3,  where 
weak  enhancement  is  seen  for  wavelengths  of  351  nm  and 
greater,  while  a  strong  enhancement  is  seen  for  the  deep  ul¬ 
traviolet  wavelengths  of  257  and  248  nm.  The  open  circle 
shown  at  zero  photon  energy  is  the  oxide  thickness  from  a 
control  sample.  This  sample  was  cleaned  and  handled  in  a 
manner  similar  to  the  other  points  in  Fig.  3,  but  it  was  not 
exposed  to  laser  radiation.  It  shows  the  thickness  of  a  typical 
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Fig.  3.  GiAs-oxide-Uyer  thickness  determined  by  XPS  for  different  wave¬ 
length  laser  irradiation  at  50  mW/cm:.  Each  irradiation  was  performed  in 
laboratory  air  for  20  minutes. 


residual  oxide  formed  during  sample  handling  in  air.  The 
enhanced  oxidation  rates  can  be  clearly  seen  on  top  of  this 
residual  thickness.  It  is  interesting  to  note  that  similar  results 
were  found  regardless  of  whether  the  source  was  cw  or 
pulsed.  In  either  case,  the  laser  intensities  were  low  so  as  not 
to  cause  thermal  oxidation. 

The  wavelength  dependence  of  the  oxidation  of  GaAs  was 
confirmed  by  a  comparison  of  oxides  grown  on  clean  sur¬ 
faces  in  controlled  ambients.  A  chemically  cleaned  GaAs 
sample  was  loaded  into  the  UHV  system  and  then  sputtered 
and  annealed  as  described.  XPS  was  used  to  confirm  that  no 
oxide  was  on  the  surface  within  detection  limits.  The  clean 
sample  was  transferred  to  a  preparation  chamber  into  which 
gases  were  controllably  leaked.  Laser  light  was  directed  at 
the  sample  surface  through  a  Suprasil  window  on  the  prep¬ 
aration  chamber,  and  the  sample  was  then  transferred  for 
analysis  without  exposure  to  laboratory  air.  Using  an  envi¬ 
ronment  of  10-Torr  water  vapor  and  10-Torr  O,  in  the  prep¬ 
aration  chamber,  20-min  irradiation  with  an  excimer  laser  at 
351  nm  (XeF)  produced  only  1.1  A  of  oxide.  Under  similar 
conditions,  with  248  nm  (KrF)  illumination,  an  oxide  layer 
of  8. 1  A  formed.  The  relative  difference  in  the  shifted  oxide 
peak  can  clearly  be  seen  in  the  XPS  spectra  of  Fig.  4  where  a 
strong  wavelength  dependence  for  the  oxidation  of  GaAs  is 
confirmed. 

To  understand  the  growth  characteristics  of  the  GaAs  ox¬ 
ide,  additional  experiments  were  performed  in  the  well-con¬ 
trolled  environment  of  the  UHV  system.  The  ambient  in  the 
preparation  chamber  was  varied  from  lOTorrto  1  atmofO-, 
and  from  0  to  10  Torr  of  water  vapor.  A  typical  exposure 
time  was  20  min.  When  the  GaAs  sample  was  exposed  to 
moist  O,  for  20  min  without  laser  irradiation,  no  oxidation 
was  detected.  Also,  when  the  sample  was  irradiated  in  10 
Torr  of  water  vapor  with  no  0:  introduced,  no  oxide  was 
detected. 

A  comparison  was  made  of  the  oxidation  rate  in  a  moi' 
O,  ambient  and  a  dry  0;  ambient.  The  moist  ambient  con¬ 
sisted  of  a  mixture  of  10-Torr  O,  and  10-Torr  water  vapor 
The  dry  ambient  was  10-Torr  O,  only.  As  shown  in  Fig.  5. 
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BINDING  ENERGY  <«V) 

Fig  4  X-ray  photoelectron  spectra  of  As  2/?,  .  and  Oa  2p ,  .  peaks  of  the 
clean  and  laser-irradiated  samples.  Laser  exposure  was  for  20  min  in  a  10 
Torr  O./lOTorr  H.O  ambient 


for  oxide  layers  up  to  ~  5  A,  there  is  no  detectable  difference 
in  the  oxidation  rate  by  illumination  in  moist  O,  and  dry  02. 
However,  for  thicker  oxides,  those  irradiated  in  a  moist  O, 
ambient  grow  much  faster  than  those  irradiated  in  dry  02. 
We  believe  that  enhancement  due  to  the  moist  environment 
is  similar  to  the  enhancement  of  water  vapor  in  silicon  oxida¬ 
tion  where  hydroxyl  groups  are  incorporated  in  the  oxide 
layer.  This  oxide  is  more  porous  than  the  oxide  grown  in  dry 
O,,  allowing  more  rapid  transport  of  reactants  through  the 
oxide  layer. :4  Further  experiments  are  needed  to  determine 
which  reactants  are  being  transported  through  the  oxide  and 
to  confirm  that  the  hydroxyl  groups  are  incorporated 
Another  series  of  observations  is  shown  in  Fig.  6.  Here  we 
have  measured  the  O-  exposure  to  the  GaAs  surface  in  terms 
of  Langmuir,  determined  by  the  oxygen  pressure  and  expo¬ 
sure  time.  The  time  of  the  oxygen  exposure  also  equals  the 
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EXPOSURE  TIME  (m*n) 

FfC.  5.  GaAs-oxide  layer  thickness  as  a  function  of  laser  exposure  time  The 
results  are  compared  for  dry  and  moist  0:  ambient 
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Fig.  6.  GaAs-oxide  layer  thickness  as  a  function  of  0:  exposure  ( 1 
L*  10" 6  Torr  s).  The  results  are  compared  for  cases  when  either  the  Or 
pressure  or  the  exposure  time  was  held  constant. 

time  of  the  laser  exposure.  The  figure  shows  very  different 
oxide  thicknesses  for  the  same  02  exposure  produced 
through  different  means.  When  the  exposure  time  is  held 
constant,  and  the  02  pressure  is  increased,  the  oxide  thick¬ 
ness  only  slowly  increases.  On  the  other  hand,  when  the  O- 
pressure  is  held  constant,  while  the  exposure  time  is  in¬ 
creased,  the  oxide  thickness  increases  much  more  rapidly.  It 
has  been  suggested  that  a  more  appropriate  measure  of  expo¬ 
sure  would  include  a  dependence  on  the  number  of  carriers 
generated  at  the  surface.2 ' 

VI.  DISCUSSION  AND  CONCLUSIONS 

Several  groups  have  postulated  that  photogeneration  of 
carriers  is  responsible  for  the  enhanced  oxidation  of  GaAs 
under  illumination,2125  26  but  a  detailed  mechanism  of  the 
role  of  carriers  is  not  yet  developed.  For  gas-phase  oxidation 
it  has  been  suggested  that  the  attachment  of  hot  electrons  to 
physisorbed  0:  molecules,  with  the  resultant  lowering  of  dis¬ 
sociation  energy,  is  responsible  for  the  faster  oxidation 
rate  27  A  similar  effect  has  been  claimed  for  the  photoen- 
hanced  oxidation  of  silicon.2*  Another  effect  that  has  been 
suggested  is  the  surface  recombination  of  photogenerated 
electron-hole  pairs  which  provide  energy  to  accelerate  the 
oxidation  reaction  22  However,  the  strong  wavelength  de¬ 
pendence  of  the  photoenhancement  of  GaAs  oxidation,  in 
either  gas  or  liquid-phase  experiments  has  not  been  pre¬ 
viously  reported. 

In  order  to  attribute  the  wavelength  dependence  of  oxide 
growth  to  the  effect  of  photogenerated  earners,  we  need  to 
eliminate  several  other  possible  effects  of  illumination  that 
are  wavelength  dependent.  One  effect  is  the  direct  photon 
excitation  of  0:  molecules  before  they  reach  the  surface  The 
dissociation  limit  for  O-  is  known  to  be  5.1  eV  2V  Except 
where  noted,  this  energy  is  greater  than  photon  energies  used 
in  our  experiments.  In  order  to  check  that  O-  was  not  being 
excited  by  radiation  in  the  gas  phase,  a  cylindncal  lens  ( 1  m 
focal  length)  was  used  to  focus  a  248-nm  ( KrF)-excimer 
beam  parallel  to  the  surface  of  a  clean-GaAs  sample  in  a 
typical  environment  in  the  preparation  chamber.  The  energy 
of  these  photons  is  4.9b  eV.  For  a  20  min  exposure,  virtually 
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no  oxide  was  formed.  This  is  compared  to  8  A  of  oxide  for  a 
broad  beam  of 248-nm  illumination  normal  to  the  surface.  In 
addition,  experiments  were  performed  with  193  nm  (ArF) 
laser  light.  The  photon  energy,  h v  —  6.4  eV,  is  now  above  the 
O,  dissociation  limit.29  Consequently,  a  parallel  illumina¬ 
tion  grew  25  A  of  oxide,  due  to  the  known  effect  of  ozone 
oxidation.10  Direct  beam  illumination  under  similar  condi¬ 
tions  produced  more  than  70  A  of  oxide  (our  XPS  thickness 
limit  due  to  the  escape  depth  of  the  As  3 d  peak ) . 

The  effect  of  temperature  rise  can  also  be  eliminated  as  a 
cause  of  oxidation.  We  confirmed  experimentally  that  the 
temperature  required  for  our  typical  oxide  growth  rate  was 
—450  *C.  Calculations  of  the  temperature  rise  of  our  GaAs 
surfaces  due  to  the  very  low  intensity  radiation,  typically  1 
mJ/cm2,  give  an  upper  limit  of  a  3  *C  transient  temperature 
rise.11  Furthermore,  due  to  similar  absorption  depths  of  35 1 
and  248-nm-laser  radiation  in  GaAs,  equivalent  intensities 
of  illumination  would  produce  a  nearly  identical  surface 
temperature  rise  in  each  case.  However,  we  have  seen  that 
for  equivalent  exposures,  the  248  nm  light  enhances  the  oxi¬ 
dation  much  more  than  the  35 1  nm  light.  Therefore,  thermal 
effects  can  be  ruled  out  as  the  cause  of  the  wavelength  de¬ 
pendence  of  the  photoenhanced  oxidation  of  GaAs.  We  thus 
attribute  the  wavelength  dependence  of  the  enhancement  of 
GaAs  oxidation  to  the  wavelength  dependence  of  the  photo¬ 
generation  of  carriers. 

At  this  time,  the  exact  role  of  electrons  and  holes  in  the 
oxidation  of  GaAs  is  not  known.  It  has  been  suggested  that  a 
possible  role  of  the  photogenerated  electrons  is  that  a  che¬ 
misorbed  oxygen  molecule  takes  up  an  electron,  enhancing 
the  breakup  of  the  molecule.  It  has  also  been  suggested  that 
photogenerated  holes  weaken  the  bonds  between  the  sub¬ 
strate  atoms  at  the  surface.  These  effects  would  stimulate  the 
uptake  of  oxygen.21 

In  conclusion,  it  is  important  to  note  that  the  energy  of  the 
photogenerated  electron  is  nearly  constant  over  the  wave¬ 
lengths  used  here  (see  Fig.  1 )  making  it  unlikely  that  elec¬ 
trons  are  responsible  for  the  wavelength  dependent  enhance¬ 
ment  mechanism.  In  contrast,  the  energy  of  the 
phctogenerated  hole  varies  greatly  over  these  wavelengths, 
indicating  that  hot  holes  may  be  responsible  for  the  wave- 
length-dependent  enhancement  of  the  oxidation  of  GaAs  ob¬ 
served.1 
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ABSTRACT 

Low-power  ultraviolet  (350  nm)  laser  light  has  been  used  to 
inhibit  polymer  formation  on  silicon  in  a  plasma  reactor  containing 
CF4/H2  reactants.  The  resultant  increase  in  etch  rate  has  been 
studied  as  a  function  of  hydrogen  partial  pressure.  Auger  electron 
spectroscopy  reveals  a  reduction  in  polymer  formation  in  the  area 
illuminated  by  the  laser. 
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In  previous  work1'2,  we  have  described  some  of  the  effects  of 
cw  laser  radiation  focused  onto  the  surface  of  silicon  in  a  plasma 
reactor.  Briefly,  it  was  found  that  when  laser  light  illuminated  the 
surface  of  silicon  in  a  CF4/O2  plasma,  the  etch  rate  was  locally 
enhanced.  At  high  laser  powers,  the  enhancement  was  primarily  due  to 
thermal  effects.  At  low  laser  powers,  the  effect  was  primarily 
nonthermal  and  was  dependent  on  the  doping  level  of  the  substrate  and 
the  wavelength  of  the  laser  light. 

We  have  now  shown  that  addition  of  a  polymerizing  agent  to  the 
discharge  increases  both  the  relative  and  absolute  magnitudes  of  the 
laser  enhancement.  The  laser  light  controls  the  etch  rate  by 
reducing  polymer  deposition  on  silicon.  This  laser-induced 
desorption  may  be  used  to  change  both  the  etch  rate  and  the  etch 
profile  in  a  plasma  reactor.  The  contrast  provided  under 
polymerizing  conditions  with  low-intensity  laser  illumination  greatly 
exceeds  that  obtained  with  a  CF4/O2  plasma.  The  mechanism  here, 
as  in  previous  work1' 2 f  is  attributable  to  the  production  of 
electron-hole  pairs. 

The  experimental  arrangement  is  similar  to  that  described  in  the 
previous  work1'2.  The  plasma  etcher  was  operated  with  40  mTorr  of 
CF4  (flow  rate  =  37  seem)  and  with  0  to  30  mTorr  of  H2.  The 
radio  frequency  (rf)  power  was  30  watts  --  corresponding  to  an  rf 
power  density  of  0.05  W/cm2.  The  laser  source  was  a  krypton-ion 
laser,  line  tunable  to  various  wavelengths  between  350  nm  and  800 
nm.  The  majority  of  the  work  reported  in  this  paper  was  done  with 


the  multiline  near-uv  output,  350-357 


In  addition,  in  the 
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present  experiment,  a  differentially  pumped  mass  spectrometer  with  a 
high-pressure  sampling  head  is  attached  to  the  plasma  etcher 
chamber.  With  the  plasma  etcher  at  a  pressure  of  approximately  100 
mTorr,  the  mass  spectrometer,  pumped  by  a  50-1/sec  turbomolecular 
pump,  has  a  base  pressure  of  3x10“^  Torr. 

Typically,  an  unmasked  sample  of  (100)  silicon  was  clamped  to  the 


water-cooled  baseplate  --  previous  work  showed  this  to  be  an  adequate 
method  of  ensuring  good  thermal  contact  to  the  baseplate.  The  sample 
size  and  doping  were  1  cm2  and  n-type  (1  Q-cm) ,  respectively. 
Sample  masking  allowed  the  etch  or  deposition  depth  to  be  measured 
after  the  run  with  a  surface  prof ilometer .  For  some  of  the  runs,  an 
aluminum-masked  piece  of  silicon  was  used  to  check  the  dark  etch  (or 
deposition)  rate. 

Figure  1  shows  the  change  in  the  dark  etch  rate  of  the  silicon, 
as  the  partial  pressure  of  H2 .  added  to  the  40  mTorr  CF4,  is 
increased  from  0  to  30  mTorr  (this  change  corresponds  to  0  to  43%  of 
the  total  pressure,  respectively).  The  dark  etch  rate  drops  an  order 


of  magnitude  over  this  range  of  H2  concentration. 


:f  the  Hr 


concentration  is  raised  above  -  30  mTorr,  deposition  of  a  polymer 
layer,  instead  of  etching,  occurs. 

The  differentially  pumped,  residual  gas  analyzer  was  used  to  look 
at  changes  in  gas-phase  components,  as  a  function  of  H2  partial 
pressure.  It  was  not  possible  to  monitor  fluorine  concentration 
directly,  since  fluorine  atoms  are  very  reactive  and  very  few  pass 
through  the  orifice  into  the  residual  gas  analyzer  chamber  without 
reacting  via  wall  or  gas-phase  collisions.  However,  it  was  possible 
to  monitor  HF  concentration  instead. 
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Figure  2  illustrates  the  variation  in  HF  signal  as  a  function  of 
H2  partial  pressure.  The  curve  indicates  that  the  production  of  HF 
peaks  at  about  20-mTorr  H2  partial  pressure,  then  declines.  Since 
HF  production  is  an  important  mechanism  for  scavenging  fluorine  from 
the  gas,  this  implies  that  as  H2  partial  pressure  is  increased,  the 
fluorine  concentration  falls,  then  rises  again. 

Samples  coated  with  a  brownish  polymer  film  during  processing 
were  removed  from  the  reactor  and  examined  with  x-ray  photoelectron 
spectroscopy  (XPS)  under  ultrahigh  vacuum  conditions.  Figure  3  shows 
the  XPS  spectrum  of  the  coated  surface.  The  peak  corresponding  to 
the  C  Is  level  appears  to  have  a  multiplet  structure,  with  five 
partially  resolved  components  indicated  by  dashed  lines.  From  the 
high  binding  energy  side,  the  components  represent,  in  succession, 
trifluorocarbon  (292.9  eV),  difluorocarbon  (290.9  eV), 
monofluorocarbon  (287.9  eV)  alkyloxyl  carbon  (286.6  eV),  and 
alkyl/graphitic  carbon  (284.6  eV).3  The  comparable  intensities  of 
all  components  suggest  an  extensively  fluorinated  structure  for  the 
polymer  film. 

In  the  Si  2p  level,  the  small  unresolved  broader  peak  (binding 
energy  at  102  -  103  eV)  corresponds  to  silicon  oxides  and  possibly 
some  fluorides  as  well,  although  polymer  deposition  was  the 
predominate  process  for  these  samples.  The  oxides  were  probably 
formed  in  air  before  the  samples  were  placed  in  the  surface  analysis 
vacuum  chamber.  The  larger  peak  at  99.3  eV  originates  from  the 
silicon  substrate. 

The  results  obtained  in  this  set  of  experiments  are  consistent 
with  a  process  in  which  the  silicon  etch  rate  is  limited  at  lower 


H2  concentrations  by  lack  of  gas-phase  fluorine  atoms,  and  at 
higher  H2  concentrations  by  the  formation  of  a  polymer  layer  on  the 
silicon  surface,  which  can  block  reaction  sites.  The  work  done  by 
several  other  researchers  on  the  role  of  hydrogen  in  CF4/H2 
plasmas  supports  this  explanation.  In  particular,  although  the  role 
of  H2  in  polymerization  is  very  dependent  on  system  parameters  such 
as  total  pressure,  flow  rate,  electrode  spacing,  and  RF  power,  it  has 
been  generally  found4' 6,7  that  adding  H2  to  a  CF4  plasma 
reduces  the  silicon  etch  rate  through  a  combination  of  scavenging 
fluorine  from  the  gas  and  forming  polymer  on  the  silicon  surface. 

When  unmasked  silicon  samples  were  illuminated  with  a 
weakly-focused  350-nm  laser  light,  an  enhanced  local  etch  rate  was 
observed.  The  laser  spot  size  on  the  sample  was  typically  -1/4  mm 
in  diameter.  By  patterning  the  laser  beam  so  as  to  have  light  and 
dark  regions  on  the  substrate,  the  ratio  of  the  laser-enhanced  etch 
rate  to  the  dark  etch  rate  could  be  measured  as  a  function  of  H2 
partial  pressure.  These  data  are  displayed  in  Fig.  4.  In  this 
experiment,  the  laser  power  density  was  approximately  80  W/cm2  in 
the  illuminated  areas.  From  0  to  10  mTorr  of  H2 ,  the  enhancement 
ratio  increased  slightly.  However,  at  about  15-mTorr  of  H2 
pressure,  there  was  a  large,  abrupt  rise  in  the  relative 
light-enhanced  etch  rate.  After  this  point,  the  etch  rate  again  rose 
slightly  with  increasing  H2  pressure.  The  laser  has  its  greatest 
affect  at  the  higher  H2  concentrations,  where  it  is  known  from 
previous  studies  that  there  is  an  adequate  supply  of  fluorine  in  the 
gas  to  allow  more  rapid  etching  to  occur  if  the  surface  is  not 
blocked  by  polymer  deposits. 

When  the  unfocused  350-nm  laser  output,  at  an  intensity  of 


approximately  5  w/cm2,  was  used  to  illuminate  the  surface  of  the 
unmasked  silicon  sample,  formation  of  the  polymer  layer  was  inhibited 
at  the  point  of  impingement  of  the  laser  beam.  When  the  laser  beam 
was  directed  on  the  sample,  an  area  that  appeared  visually  clear  of 
polymer  film  formation  resulted.  Auger  electron  spectra  were  taken 
both  inside  and  outside  of  the  illuminated  region.  Results  show  a 
reduction  of  at  least  fifty  percent  in  carbon,  oxygen,  and  fluorine 
within  the  illuminated  region.  This  indicates  that  the  laser 
exposure  substantially  reduces  the  amount  of  polymer  formation  on  the 
silicon  surface;  this  reduction  allows  the  etch  process  to  proceed  at 
a  more  rapid  rate. 

In  our  previous  experiments1'2,  it  was  found  that 
laser-enhancement  of  the  etch  process  depended  on  the  wavelength  of 
the  laser  source.  This  effect  was  interpreted  to  be  a  manifestation 
of  photogenerated  carrier-induced  surface  chemistry,  with  this  in 
mind,  the  experiment  of  Figure  4  was  repeated  using  the  647-nm 
output  from  the  krypton-ion  laser,  with  approximately  the  same  power 


density. 


It  was  found  that  the  slow  rise  in  the  ratio  of 


light-to-dark  etch  rates  occurs  again.  However,  the  sharp  increase 
at  15-mTorr  partial  pressure  H2  was  missing. 

The  above  result  may  be  understood  on  the  basis  of  a  mechanism 
based  on  carrier-induced  desorption.  It  has  been  previously  found  in 
experiments  involving  both  wet  and  dry  laser-induced  chemistry  that 
carrier  production  with  UV  light  results  in  an  increase  in 
desorption,  or  other  surface  reaction  rates.  This  effect  results 
from  the  fact  that  UV  light  is  absorbed  sufficiently,  -100A,  close 
to  the  solid  surface  that  bulk  recombination  effects  do  not  reduce 
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the  photogenerated  carrier  concentration  at  the  surface8.  On  the 
other  hand,  for  647-nm  light,  the  absorption  depth  is  -3um,  and 
the  carrier  flux  to  the  surface  is  greatly  reduced.  Note  that  for 
both  uv  and  red  light  with  the  given  spot  size,  the  sample  heating  at 
the  surface  will  be  essentially  the  same,  although  negligible,  at 
these  power  densities.  The  absence  of  the  sharp  etch  rate 
enhancement  at  15  mTorr  partial  pressure  when  647-nm  light  is  used 
may  therefore  be  explained  on  the  basis  of  carrier  induced 
desorption. 

Two  potential  applications  of  this  laser-induced  process  would  be 
to  increase  the  anisotropy  of  silicon  etched  in  a  plasma  reactor, 
without  increasing  the  degree  of  ion  bombardment,  and  to  permit 
maskless  etching  using  a  projected,  patterned  laser  beam.  With 
regard  to  the  first  application,  since  the  laser  light  has  been  shown 
to  increase  the  etch  rate  in  this  system,  it  might  be  expected  to 
change  the  directionality  of  the  etch  as  well.  To  examine  this 
possibility,  silicon  masked  with  a  10-um  bar  pattern  was 

illuminated  with  a  weakly  focused  (-lOOum)  laser  beam,  with  15 
mTorr  of  H2  added  to  40  mTorr  of  CF4  in  the  plasma  reactor. 
Profiles  were  analyzed  using  an  SEM.  Initial  results  indicate  that 
the  light  yields  a  profile  which  is  not  only  deeper,  but  also  has  a 
much  cleaner  wall  and  bottom  structure  than  in  the  unilluminated 
case. 

In  conclusion,  we  have  demonstrated  that  low  intensity,  cw,  uv 
light  at  350  nm  can  be  used  to  inhibit  polymer  formation  on  silicon 
in  a  plasma  reactor  operated  with  CF4  and  H2.  The  dependence  of 

the  laser-enhanced  etch  rate,  as  a  function  of  H2  partial  pressure, 


is  due  to  variations  in  both  gas-phase  fluorine  concentration  and 
polymer  formation  on  the  silicon  surface.  Work  is  currently  underway 
to  try  to  use  this  effect  to  obtain  improved  anisotropy  in  the  plasma 
reactor,  without  the  damage  associated  with  increased  ion  flux  and 
energy. 
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Plasma  etch  rate  of  silicon  vs.  partial  pressure  of 
hydrogen  added  to  40  mTorr  of  CF4. 


Figure  2: 


HF  (Mass  20)  signal  from  differentially  pumped  mass 
spectrometer  attached  to  etching  chamber. 
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Figure  3 


X-ray  photoelectron  spectra  of  silicon  surface  with 
deposited  polymer  film. 


Figure  4: 


Laser-induced  etch  enhancement  vs.  partial  pressure  of 
hydrogen  added  to  40  mTorr  CF a. 
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Bombardment  of  silicon  (100)  surfaces  at  room  temperature  by  an  oxygen-containing  low-energy 
ion  beam  is  studied  as  an  alternative  to  thermal  oxidation  to  produce  ultrathin  oxide  films.  A  self- 
limiting  oxide  thickness  of  about  50  A  is  obtained  by  using  ions  with  energy  100  eV  or  lower. 

Auger  electron  spectroscopy  depth  profiles  of  an  ion-beam  grown  oxide  and  a  thermally  grown 
oxide  show  very  similar  composition.  Grazing  angle  x-ray  photoelectron  spectroscopy  indicates 
the  presence  of  lower  oxides  of  silicon  near  the  surface.  The  capacitance-voltage  characteristics  of 
ion-beam  grown  oxides  compare  favorably  with  those  of  thermally  grown  oxides. 


I.  INTRODUCTION 

Ultrathin  gate  oxide  films  are  expected  to  gain  increasing 
importance  as  metal-oxide  semiconductor  ( MOS )  devices 
are  scaled  to  smaller  dimensions.  At  the  same  time,  low- 
temperature  processes  are  already  replacing  many  thermal 
fabrication  steps  in  an  attempt  to  minimize  damage  to  the 
substrate.  Many  processes  have  been  proposed  as  low-tem¬ 
perature  alternatives  to  thermal  oxidation  of  silicon,  the 
most  notable  being  plasma  oxidation  and  chemical  vapor 
deposition  (CVD).'~3  These  methods,  however,  have  not 
yet  produced  thin  oxides  suitable  for  MOS  applications.  We 
have  studied  the  bombardment  of  ( 100)  silicon  surfaces  by 
oxygen-containing  ion  beams  and  have  successfully  grown 
ultrathin  device-quality  oxides.4 

The  composition  of  the  obtained  ultrathin  oxides  is  ana¬ 
lyzed  by  Auger  electron  spectroscopy  and  x-ray  photoelec¬ 
tron  spectroscopy.  The  oxides  are  also  characterized  electri¬ 
cally  by  studying  the  capacitance-voltage  (C-K)  and 
current-voltage  (I-V)  characteristics  of  aluminum-gate 
MOS  capacitors. 

II.  EXPERIMENTAL  METHOD 

Low-energy  ion-beam  oxidation  uses  a  2.5-cm-diam  sin¬ 
gle-grid  Kaufman-type  source  which  produces  a  Gaussian 
ion  beam  with  energies  up  to  100  eV.  The  maximum  beam 
current  density  at  the  target  is  150/tA/cm\  The  source  uses 
argon  and  oxygen  gases  in  varying  ratios  with  an  oxygen 
partial  pressure  in  the  low  10“4  Torr. 

Ion-beam  oxide  MOS  capacitors  are  fabricated  on  wafers 
which  have  a  5000-A  wet  oxide  grown  at  950  ’C  and  an¬ 
nealed  for  30  min  at  the  same  temperature  in  dry  N;.  The  ion 
beam  is  then  used  to  grow  ultrathin  gate  oxides  in  windows 
etched  in  the  thick  oxide.  The  wafers  are  unheated;  a  ther¬ 
mocouple  mounted  on  the  substrate  holder  indicates  typical 
temperature  rises  of  <  5  *C  above  room  temperature.  Alumi¬ 
num  is  evaporated  and  patterned  to  form  the  capacitors. 
After  metallization  the  sample  is  annealed  in  forming  gas  at 
400  *C  for  5  min. 

Control  samples  of  MOS  capacitors  with  thermally  grown 
gate  oxides  are  fabricated  according  to  the  same  procedure 
Here  the  gate  oxides  are  grown  in  dry  oxygen  at  900  ”C  and 
annealed  in  situ  in  nitrogen. 


Samples  for  surface  analysis  are  prepared  according  to  the 
same  procedure  without  the  aluminum  deposition.  Surface 
analysis  was  performed  using  a  multiprobe  surface  spectros¬ 
copy  system  equipped  with  a  concentric  hemispherical  ener¬ 
gy  analyzer.  X-ray  photoelectron  spectroscopy  (XPS)  mea¬ 
surements  were  carried  out  with  the  Mg  Ka  line  ( 1253.6 eV) 
at  240  W.  The  system  was  calibrated  from  the  Au  4/7(2  level 
with  a  binding  energy  of  83.8  eV.  C  lr  from  residual  carbon 
on  the  surface  was  verified  to  have  binding  energy  284.6  eV 
to  eliminate  any  charging  effects.  Auger  electron  spectra 
were  taken  with  a  3-keV  electron  beam  with  a  1  -mm  spot  size 
on  the  surface.  A  differentially  pumped  ion  gun  was  used  to 
generate  3-keV  argon  ions  to  sputter  the  surfaces  The  typi¬ 
cal  ion  current  density  used  was  1.2  /xA/cm\  The  system 
pressure  was  kept  below  5x10“'°  Torr  for  XPS  and 
5  X  10“ *  Torr  for  Auger  electron  spectroscopy  ( AES )  and 
maintained  at  2x  10“ 7  Toit  in  argon  while  sputtering 

III.  RESULTS  AND  DISCUSSION 

Auger  electron  spectroscopy  in  conjunction  with  argon 
ion  sputtering  is  used  to  study  the  depth  profile  of  the  oxide 
layer.  As  shown  in  Fig.  1,  theO  (KLL)  and  Si  (LVV)  Auger 
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J.  Vac.  Set.  Techno).  A  5  (4),  Jut/ Aug  1W7  0734-210 1/»7/0415«»-03*01. 00 


c  1M7  American  Vacuum  Society 


1570 


Yu,  Todorov,  and  Fossum:  Characterization  of  ultrathin  SIO,  fllma 


1570 


peak-to-peak  signals  are  plotted  as  a  function  of  sputtering 
time  for  both  an  ion-beam  grown  oxide  and  a  thermally 
grown  oxide  about  50  A  thick  as  measured  by  ellipsometry. 
For  the  Si  ( L  VV)  emission  only  the  signal  from  the  substrate 
is  shown  because  of  the  greater  sensitivity.  The  close  resem¬ 
blance  of  the  two  depth  profiles  indicates  that  the  ion-beam 
oxide  is  of  comparable  composition  to  the  thermally  grown 
oxide,  except  that  the  latter  has  a  somewhat  sharper  inter¬ 
face.  Because  of  the  limited  depth  resolution  the  actual  inter¬ 
face  for  both  samples  is  much  sharper  than  it  appears. 

Angle-dependent  x-ray  photoelectron  spectra  of  the  ion- 
beam  grown  oxide  are  shown  in  Fig.  2  for  increasing  detector 
angles.  6  is  the  angle  between  the  detector  axis  and  the  sur¬ 
face  normal.  At  low  detector  angles  the  Si  2 p  (Si02 )  signal  is 
peaked  at  102.9  eV,  a  typical  value  observed  for  very  thin 
Si02.5  However,  at  9  =  80“  where  the  electron  escape  depth 
is  about  5  A,  the  peak  position  is  shifted  to  102.1  eV.  This 
indicates  the  presence  of  lower  oxides  in  the  top  few  layers 
due  either  to  the  preferential  sputtering  or  to  the  low  mobil¬ 
ity  of  the  oxygen  atoms  incorporated  in  the  film.  The  O  Is 
peak  has  an  unresolved  component  which  grows  in  relative 
proportion  toward  higher  detector  angle  and  is  shifted  to¬ 
ward  lower  binding  energy  by  about  1.2  eV.  It  has  been 
shown  that  the  O  Is  peak  due  to  lower  oxides  in  ultrathin 
thermally  grown  SiO:  does  not  shift  more  than  0.3  eV.6 
Therefore,  this  component  is  probably  associated  with  the 
radiation  damage  caused  by  the  ion  bombardment.  Large 
sample  size  was  used  to  avoid  the  projection  loss  at  grazing 
angle.  The  large  decrease  of  intensity  at  d  =  80°  is  probably 
due  to  misalignment  of  the  detector  axis  and  the  sample  rota¬ 
tion  axis.  Both  theO  Is  and  Si  2p(SiO;)  peaks  from  the  ther¬ 
mally  grown  oxide  (not  shown  here)  appear  to  be  sharper 
than  the  ones  from  the  ion-beam  oxide  and  their  shapes  do 
not  change  at  higher  detector  angles. 

Figure  3  compares  the  C-V  and  I-V  characteristics  of  a 
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Fic.  3  Companion  of  high-frequency  capacitance-voltage  and  current- 
voltage  characteristics  for  MOS  capacitors  with  thermally  grown  (solid 
lines)  and  ion-beam  grown  (dashed  lines)  gate  oxides. 

capacitor  with  an  ion-beam  grown  oxide  and  a  thermally 
fabricated  capacitor.  The  thickness  of  the  ion-beam  oxide  is 
56  A;  the  thermal  oxide  is  52  A  thick.  The  ion-beam  oxide  is 
grown  using  a  100-eV  ion  beam  with  a  P5  ,uA/cnr  maxi¬ 
mum  current  density  at  the  target.  The  Ar:0;  ratio  in  the 
source  is  1:1.  The  sample  is  exposed  for  6  min  leading  to  a 
maximum  dose  of  2.8X  10'7  cm-2. 

The  I-V  curves  clearly  show  that  the  ion-beam  oxide 
passes  some  current — in  reverse  bias  its  leakage  current  is 
three  orders  of  magnitude  greater  than  the  thermal  oxide 
leakage  current.  This  may  be  due  to  a  combination  of  higher 
thermal  generation  rates  and  a  lower  oxide  impedance. 
Note,  however,  that  the  leakage  current  through  the  oxide 
does  not  hinder  the  proper  functioning  of  MOS  transistors 
fabricated  with  ion-beam  gate  oxides.7  The  leakage  current 
is  more  than  three  orders  of  magnitude  smaller  than  the 
drain-source  saturation  current. 

A  comparison  of  the  C-V  curves,  however,  is  much  more 
favorable.  Note  that  the  characteristics  are  similarly  steep  in 
depletion  and  in  the  sharp  onset  of  inversion.  The  small  hys- 
teres’s  of  the  ion-beam  sample  indicates  the  presence  of  some 
interface  states.  Quantification  of  the  interface  states  is  diffi¬ 
cult  due  to  the  large  leakage  current.  The  threshold  voltage 
shift  of  the  ion-beam  oxide  is  also  indicative  of  uncompensat¬ 
ed  damage  to  the  oxide  or  substrate.  The  behavior  of  the  ion- 
beam  oxide  in  inversion  is  easily  understood  from  the  I-V 
curves — the  deep  depletion  is  due  to  some  of  the  inversion 
layer  charge  leaking  away.  The  observed  rise  in  the  C-V 
characteristics  in  inversion  is  believed  due  to  the  coupling  of 
minority  carriers  from  surrounding  inverted  regions  under 
the  field  oxide. 

IV.  SUMMARY 

Low-energy  ion-beam  oxidation  at  room  temperature  has 
successfully  produced  ultrathin  films  of  silicon  dioxide  suit¬ 
able  for  use  in  MOS  devices.  The  obtained  film  composition 
is  comparable  to  that  of  thermally  grown  ultrathin  oxide 
films. 


BINDING  ENERGY  (*V) 

Fig  2  Angle-dependent  XPS  spectra  of  ion-beam  grown  oxide  The  Si  Ip 
peak  with  lower  binding  energy  (  —  99  eV),  which  disappears  near  grazing 
angle,  corresponds  to  the  substrate  The  peak  with  higher  binding  energy 
(  -  103  eV)  corresponds  to  the  oxide 
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LIGHT-GUIDED  ITCHING  TOR  III-V  SEMICONDUCTOR  DEVICE  FABRICATION 

Dragan  V.  PODLESNIK 

Microelectronic*  Science*  Laboratories  and  Center  for 
Teleconnunieaton*  Research,  Columbia  University,  New  York  City 
New  York  10027,  US. A. 

The  rapid,  ultraviolet-induced  aqueous  etching  produces  vertical, 
high  aspect  features  in  GaAs  samples  of  different  crystal 
orientations  Much  of  the  speed  and  anisotropy  of  the  etching  is 

attributed  to  the  formation  of  efficient  hollow,  optical  waveguides. 

These  guides  have  been  characterized  by  measuring  the  optical  loss 
and  the  field  distribution  within  the  guide  The  optical  loss  is 
typically  small  and  does  not  restrict  the  etching  of  deep  features 

1  INTRODUCTION  the  third  dimension  of  a  semi  conduct : r 

Rate  anisotropy ,  which  results  in  a  wafer  is  currently  being  investigated 

strong  spatial  directionality,  is  an  as  a  means  of  providing  additional 

important  characteristic  ;r.  many  semi-  electrical  interconnections ,  2  J  .  In 
conductor  processing  operations  In  particular.  the  interest  ir.  these 

recent  years.  the  work  with  the  eniso-  through- ws f er  Interconnections  stems 

tropic  etching  has  focused  on  its  use  from  three  basic  goals:  to  reduce  the 

for  the  machining  of  semiconductor  length  of  connections  between  devices 

materials.  Vertical,  high-aspect  featu-  for  faster  processing  rates;  to  reduce 

res  are  an  important  requiremsnt  in  the  the  interference  and  crosstalk  between 

fab  ricatior.  of  advancad  alectronic  and  interconnections;  and  to  reduce  the 

micromechanical  devices!  1|.  The  clcse  area  occupied  by  i nter connect i  cr.s  cr. 

spacinga  and  small  sizes  of  modern  the  surface  of  the  semiconductor  wafer 

Integrated  circuits  require  vertical 
etching  to  eliminate  the  undercutting 
of  ediacer.t  structures.  In  addition. 


An  ti ample  is  a  CaAs  FET  with  vis 

connections  through  the  substrate' 3’ 
Such  translator  has  higher  gsir  s' 
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microwave  frequencies,  higher  power 
density,  and  potentially  lower 

manufacturing  coata  than  a  conventional 
CaAa  fiald  effect  tranaiator. 

Finally,  anlaotropic  etching  has 
been  extensively  ueed  in  nonelectronic 
appllcationai 1 1 .  Even  almple  hole*  and 
groovea  etched  in  a  semiconductor  wafer 
(typically  aillcon)  can  be  utilised  for 
many  applications.  One  usage  la  the 
generation  of  high  precision  molds  for 
microminiature  structures.  Patterns 
etched  clear  through  the  wafer  can  be 
applied  in  the  area  of  ink  jet  printing 
technology.  In  particular,  the  hole  on 
the  bettom  of  the  wafer  is  used  as  an 
orifice,  typically  about  20  urn.  for  ar. 
ink  jet  stream | 1 )  . 

2  MECHANISMS  FOP  ANISOTROPY  IN  ETCHING 
Conventionally,  etching  anisotropy 
in  single-crystal  materials  is  achieved 
by  relying  on  a  crystal  plane  dependent 
process.  such  as  reaction  rates  in  wet 
chemical  etching.  This  crystal lographi - 
cally  sensitive  etching  has  been  suc¬ 
cessfully  used  to  produce  deep  vertical 
features,  but  its  utility  is  restricted 
by  the  requirement  of  specific  and 
limited  crystal  orientations.  Localised 
electrochemical  jet  etching  has  been 
also  used  to  generate  vertical  holes  in 
semiconductor  wafers 

When  etching  occurs  by  an  ion 


assisted  reaction,  etch  rate  anisotropy 
can  be  expected,  because  Iona  are  inci¬ 
dent  normal  to  the  wafer  surface.  When 
a  semiconductor  la  plasma  etched,  in¬ 
corporation  of  an  appropriate  gas  ad¬ 
ditive  results  in  the  formation  of  a 
passivating  film  that  prevents  side 
wsll  etching.  However,  a  perfectly  ver¬ 
tical,  through-wafer  feature  cannot  be 
made  with  ion  and  electron  beam  sources 
because  of  the  spreading  of  the  parti¬ 
cle  beam  and  loss  of  the  particle  flu- 
ence  with  increasing  feature  depth.  In 
addition,  the  massive  particle  bombard¬ 
ment  produces  typically  an  incurring 
damage  of  the  semiconductor  surface 

In  this  paper  we  will  show  that 
laser- induced  aqueous  etching  with  the 
ultraviolet  beam  can  be  used  tc  make 
deep,  high-aspect  features  irrespective 
of  crystal  orientation.  If  the  laser 
beam  is  Incident  normal  to  the 
semiconductor  surface,  it  could  be 
expected  that  the  1 i ght-ass: sted 
etching  will  occur  primarily  in  the 
direction  of  the  beam.  But.  unlike  ion 
or  electron  beams.  the  laser  beam  is 
guided  by  the  etched  structure  itself 
via  gl ancing- angle  reflections  thus 
resulting  in  the  vertical  etching  This 
waveguiding  effect  coupled  with  the 
rapid  etching  at  low  laser  intensities 
was  first  seen  in  the  ultraviolet  ( 2 S ~ 
nm)  induced  aqueous  etching  of  CaAs!4! 
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Bents,  the  laser  been  was  focused  euch  etched  structure  which  prevents  the 


that  the  bean  waist  waa  on  the  front  of 
the  seniconductor  wafer.  The  entrance 
of  the  structurea  Is  well  defined  and 
the  surrounding  area  is  undisturbed.  A 
dletinct  characteristic  of  these  fea¬ 
tures  is  their  smooth  and  vertical  side 
walls.  Their  width,  typically  2-3  um. 
remaina  virtually  constant,  independent 
of  the  etched  depth.  In  contraat  to 
this.  the  focused  laser  besm,  with  the 
3  um  waist  diverges  considerably  over 
the  corresponding  distancs,  as  shewn  in 
Fig.  3  .  The  measured  confotal  beam, 
parameter  is  only  25  um  as  compared, 
for  example,  to  the  perfectly  vertical 
etch  through  the  20?-um  •  tht  t  V.  wafer. 
This  clearly  shows  that  the  processing 
beam  is  confined  inside  the  hrllow. 


bean  fron  dlvarging. 

Because  the  vertical  features  are 
created  by  uelng  a  Causalan  beam,  one 
might  think  that  their  profiles  should 
also  appear  Gaussian.  The  vertical 
walla  is  a  strong  deviation  from  the 
Causslan  shape.  In  order  to  determine  a 
mechanism  for  the  formation  of  vertical 
waveguides,  we  monitored  the  develop¬ 
ment  of  holes,  as  ehown  lr.  F:  c  4 
Initially,  the  etch  profile  is  esser.- 
tiilly  identical  to  that  of  the  inci¬ 
dent  laser  beam.  as  shown  in  Fig  S 
However,  as  the  etch  depth  increases, 
the  feature  assumes  a  tubular.  no 
Csussiar.  profile.  F:c  5 

The  initial  formation  of  these 
anisotropic  features  from  a  focused 
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Fig. 3  Comparison  of  the  etched  profile 
with  free-spare  propagation  of  the 


laser  tear  with  a  bear  waist  of  4  ur 
car  be  circulated  with,  a  model  based  or. 
the  use  cf  ray  optics  The  model  inclu¬ 
des  the  very  important  angular  depen¬ 
dence  cf  reflectivity  ar.d  absorption, 
which,  for  the  case  of  a  lcasy,  metal 


1 1  ke 

material  must 

be  obtained  from  a 

modi  f 

:ed  version  of 

the  Fresr.el  equa- 

1 1  or.s 

In  this  case. 

the  surface  slope. 

whi  ch 

is  equal  to 

the  local  incident 

ac.p.e  cf  the  laser  beam,  is  giver,  by 
»  !i .  r,  t  )«tar.  1  ' dc ( t )  /dr ( t )  )  ( 1  ) 

where  i(t)  and  r(t)  are.  respectively, 
the  time-dependent  vertical  and  radial 
coordinates  of  the  surface  structure. 
The  two  components  of  the  local  inter¬ 
face  velocity  of  removal  are  given  by 
(dr  dt)=|kl(r)(l-R(8))coa(8) | co# ( 8  )  (2) 
(d:  dt)=  *i(r)(i-R(8))ccs(8)!Bir. ( 8  )  (3) 
where  I ( r  )  represents  the  incident  Gau¬ 
ssian  beam  and  (1  -  R(8)>  is  the  local 
absorption  The  interface  velocity,  in 


portions! .  b;  the  reaction  rate  cons¬ 
tant  k,  tc  the  local  absrrbed  light 
intensity  T.-.e  ccs(8)  term  ir.  Eqs  (2) 
and  (3)  neper- -"F  the  change  in  inci¬ 
dent  p~»er  de;  « ; t ed  on  the  surface  due 
to  surface  tilt  The  last  term  ir.  Eqs  . 
(2)  and  (3)  represents  the  directiona¬ 
lity  cf  etching  w.th  this  surface  tilt. 
Material  removal  is  always  perpendicu¬ 
lar  to  the  plane  cf  the  surface. 

'hiring  the  very  initial  stages  cf 
formation  of.  say.  a  hole,  with  a  Gaus¬ 
sian  laser  bear,  all  the  angular  depen¬ 
dent  terms  are  inconsequent! al  As  a 
result.  initially  the  aurface  should 
follow  the  Intensity  dr  atributior,  of 
the  laser  beam  Therefore  the  initial 
fee- -re  will  be  Gaussian. 

Ar  the  feature  walls  become  mere 
vertical.  the  angu 1 ar -dependent  terms 
mentioned  earlier  become  more  impor¬ 
tant  E  er.  the-.,  fer  ar.  incident  Gaus- 


braces  . 


is  assured  tc  be  linearly  pre- 


a:  ar. 


1  »  r 


bear,  the  surface  structure 


take*  •  non-Gsussian  ships  Bovtvtr, 
despite  this  non-Caueelan  affect,  the 
above  nodal  will  not  produce  the  strik¬ 
ing  features  with  vertical  walla  aeen 
here.  Many  of  these  effects  occur  be¬ 
cause  light  begins  to  be  trapped  inside 
and  undergoea  Internal  reflectiona  aa 
the  hole  deepena.  Owing  to  the  angle 
dependent  reflectivity,  there  ia  atill 
enough  power  reflected  off  ‘the  aide- 
wall  e  to  be  subsequently  absorbed  as 
the  light  again  strikes  a  different 
portion  of  the  surface.  This  first 
internal  reflection,  or  second  etrike. 
does  contribute  significantly  to  the 
formation  of  the  features  Figure  6 
shows  this  ccrputer  sir.laticr.  of  via 
hole  development  that  includes  internal 
reflections  tc  the  firet  order  These 
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Fig  fc  Calculated  hole  development  with 
(eolid  line)  and  without  (dashed  line! 
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bean  is  then  optically  guided  t.-r:_c 
the  hole,  while  vertical  etching  r  :  -  ■  _  ■ 
nues  at  the  bottom  Under  these  condi¬ 
tions,  etching  can  bee  properly  mideled 
as  a  self  propagating  waveguide 

In  fact,  much  of  the  speed  srd 
sr.isotropy  cf  the  etching  cm  be  attri¬ 
buted  to  the  properties  of  these  opti¬ 
cal  waveguide- 1  ike  features  lr.  parti¬ 
cular.  the  optical  loss  is  snail,  e.er 
for  200-  to  300-um- thick  samples,  and 
does  not  significantly  limit  the  etch 
rate  with  increasing  feature  depth 
Further,  the  continuation  of  etching 
even  in  the  heavily  solvated  1 i cv i d  a* 
the  bottom  of  a  deeply  etched  feat-re 
is  indicstive  of  the  active  chemistry 
induced  by  the  ultraviolet  light 
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5  OPTICAL  WAVEGUIDE  FORMATION 


On*  ef  th*  Boat  interesting 
consequence*  of  the  unusual  laser 
chemistry  described  above  Is  that  it 
permits  th*  formation  of  long  cylindri- 
cal  tube*  or  visa  in  semiconductor  wa- 
fere.  In  fact,  they  are  alniature,  hol¬ 
low  semiconductor  waveguides.  This  un¬ 
usual  mlcrostructur*  haa  suggested  a 
number  of  novel  application*  in  micro¬ 
electronic*  and  integrated  optic*,  in¬ 
cluding  vertical.  hi gh-der* i ty  optical 
fiber  interconnect* . I  9 ]  thr Cugh - w* f e r 
optical  light  guide*.  I  101  and  high  den¬ 
sity.  through  -  wafer  via*  for  electrical 
s  r.t  erccn-.ect  i  on*  from  the  front  tc  the 
b  a :  v  cf  * :  1 : cor.  or  CaA.s  integrated 
c  i  r cui t • : 3  i 

It  was  already  suggested  that  the 
guiding  cf  the  laser  bears  it  er.  impor¬ 
tant  factor  ir.  determining  vla-hcle 
profiles  and  etch  rate*  Th*  circular 
hollow  dielectric  and  metallic  wave¬ 
guides  have  beer.  there  'My  discussed 
by  Karcatili  and  Schmel trer1 1 1 1  Tc 
illustrate  some  of  the  feature*  of  the 
hollow  sen  conductor  waveguide.  we 
applied  a  similar  analyais  for  the  vis 
holes  etched  through  CaA*  sample*  The 
theory  and  the  experiment  showed  ex¬ 
cellent  agreement  Figure  1  show*  the 
attenuation  coefficient  of  the  hollow 
CaAs  waveguide  as  •  function  of  hole 
diameter  In  agreement  with  the  theo- 
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6  CON?!/'?  ' 
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ABSTRACT 


Light  enhanced  reaction  of  oxygen  with  gallium  arsenide 


surfaces  by  irradiation  with  deep-uv,  near-uv  and  visible 
light  was  studied  using  Auger  electron  spectroscopy  and  x-ray 
photoelectron  spectroscopy  for  submonolayer  and  above 
monolayer  regimes,  respectively.  The  onset  of  a  strong 
wavelength  dependence  of  the  enhanced  oxidation  was  observed 
after  the  oxygen  coverage  reached  more  than  half  a  monolayer. 
An  abrupt  threshold  for  this  wavelength  dependence  was  also 
observed  at  -  4.1  eV  photon  energy.  Photodissociation  of  O, ' 
(superoxide  ion)  formed  by  the  photogenerated  carriers  can 
explain  this  strong  wavelength  dependence. 


vwv 


I.  INTRODUCTION 


Photon  irradiation  of  semiconductors  can  nonthermally 
enhance  chemical  reactions  on  the  surface.  This  effect  has 
been  generally  attributed  to  the  photogeneration  of  carriers 
at  the  semiconductor  surface.1  We  have  observed  a  strong 
enhancement  of  oxidation  on  gallium  arsenide  surfaces  using 
deep-uv  photons  (248  nm,  257  nm),  whereas  only  moderate 
enhancement  was  seen  using  near-uv  to  visible  photons  (350  nm 
to  514  nm)  for  experiments  were  carried  out  either  in 
laboratory  air  ambient  or  in  water  solution.7  ■'  This  laser- 
enhanced  oxidation  can  be  used  for  low- temperature  oxide 
growth  and  is  potentially  useful  for  device  applications. 
Specifically,  laser  grown  oxides  have  been  used  in  our 
laboratory  to  increase  the  variation  of  Schottky  barrier 
heights  in  metal -GaAs  contacts.4 

In  this  paper  we  report  the  results  of  a  study  of  the 
light-enhanced  oxygen  reaction  with  GaAs  surfaces  both  below 
and  above  monolayer  coverage.  Experiments  were  performed 
inside  a  stainless  Bteel  chamber  on  clean,  well-defined 
surfaces  irradiated  with  low-intensity  excimer  lasers.  In 
other  experiments  we  observed  a  wavelength  threshold  for  the 
light-enhanced  oxidation  using  different  argon-ion  laser 
lines  under  laboratory  air  ambient. 


II.  EXPERIMENTAL 


A  two-chamber  ultrahigh  vacuum  system  consisting  of  a 
surface  analysis  chamber  and  a  sample  preparation  chamber 
separated  by  a  differentially  pumped  load-lock  seal  was  used 
in  this  study.  Gallium  arsenide  surfaces  of  both  (100)  and 
(110)  orientation  doped  with  Si  to  n  =  101 8  /cm5  were  used  in 
the  experiments.  Samples  were  cleaned  with 
trichloroethylene,  acetone,  methanol,  deionized  water,  50% 
NH„0H,  and  then  blown  dry  with  nitrogen.  For  in  situ 
experiments,  the  sample  was  subsequently  placed  in  the 
analysis  chamber  and  subjected  to  mild  argon  ion  bombardment 
(500  eV,  1  pA/cmJ  )  to  remove  the  residual  oxide,  and  then 
annealed  at  540  C  for  10  min.  The  sample  was  then  moved  into 
the  preparation  chamber  filled  with  oxygen  at  a  selected 
pressure.  Through  a  quartz  window  a  laser  beam  was 
introduced  into  the  chamber  and  irradiated  the  sample  for 
1000  sec.  After  this  process  the  sample  was  moved  back  to 
the  analysis  chamber  for  surface  analysis.  An  excimer  laser 
was  used  to  irradiate  the  sample  in  the  preparation  chamber 
with  either  the  KrF  line  at  248  nm  (1  mJ/cm7  ,  100  Hz)  or  the 
XeF  line  at  351  nm  (0.2  mJ/cmJ  ,  500  Hz).  At  this  low 
intensity  the  transient  temperature  rise  on  the  surface  is 
estimated  to  be  below  10  C.‘  Some  of  the  experiments  were 
performed  ex  situ  with  an  argon-ion  laser  tuned  to  different 
wavelengths  at  a  power  density  of  50  mW/cm7  .  For  these 
experiments  samples  were  irradiated  by  the  laser  under 
laboratory  air  ambient  after  chemical  cleaning. 


X-ray  photoelectron  spectroscope  (XPS)  was  used  to  study 


the  oxides  formed  on  the  surface.  A1  (1486.6  eV) 

radiation  was  used  to  excite  the  2ps , ,  and  3d  core-level 
emission  for  both  gallium  and  arsenic.  By  making  use  of  the 
large  binding  energy  shift  of  the  oxide  arsenic  atoms 
relative  to  the  substrate  arsenic  atoms,  the  ratio  of  the 

oxide  component  to  the  substrate  component  can  be  obtained  to 

•  # 

estimate  thicknesses  between  1  A  to  50  A  as  reported 
previously.3  For  submonolayer  coverage  (>  0.1  ML)  of 

chemisorbed  oxygen.  Auger  electron  spectroscopy  (AES)  was 
used  to  study  the  oxygen  coverage  for  its  higher  sensitivity. 
By  taking  the  ratio  of  the  Auger  peak-to-peak  intensities  of 
O(KLL)  to  Ga(LMM)  and  using  a  layer  model, *  the  fractional 
coverage  of  the  oxygen  on  the  surface  can  be  determined  up  to 
a  full  monolayer.  Care  has  been  taken  to  perform  the  AES  on 
a  spo t  previously  untouched  by  the  electron  beam  after  an 
ion-bombardment- anneal ing  cycle  to  avoid  electron  beam 
effects.7  The  pressure  of  the  analysis  chamber  was  kept 
below  5x10' 10  Torr  for  XPS  and  5x10"  ’  Torr  for  AES. 

III.  RESULTS 

The  GaAs(llO)  surface  was  used  here  to  study  the 
enhanced  oxidation  by  laser  irradiation  because  it  provides  a 
uniquely  defined  surface  after  an  ion-bombardment-annealing 
cycle.*  Results  of  this  study  using  XPS  are  shown  in  Fig.  1. 
Samples  were  cleaned  by  the  ion-bombardment- annea 1 ing  process 
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and  then  irradiated  by  laser  light  in  the  preparation  chamber 
filled  with  100  Torr  dry  oxygen.  Only  As2p3  ,  :  peaks  are 
shown  because  the  oxide  and  substrate  components  are  well 
separated.  The  upper  spectrum  is  of  the  surface  irradiated 
with  248-nm  light,  the  middle  one  with  351-nm  light,  and  the 
lower  one  without  any  light.  Without  light  irradiation  only 
1.9  A  oxide  was  formed  (less  than  one  monolayer).  With 

irradiation  of  351-nm  light,  2.6  A  oxide  was  seen--a  modest 

0 

enhancement.  With  248-nm  irradiation,  9.6  A  oxide  was  grown 
under  the  same  condition.  This  strong  enhancement  of 
oxidation  by  deep-uv  light  agrees  with  our  previous  results2 
conducted  in  laboratory  air  ambient  on  GaAs(lOO)  surfaces. 

Figure  2  shows  the  results  of  the  AES  study  of  the 
oxygen  coverage  on  the  GaAs(llO)  surface  as  a  function  of 
oxygen  exposure  with  simultaneous  irradiation  of  either  248- 
nm  or  351-nm  laser  light,  or  without  any  light  irradiation. 
At  lower  coverage  the  enhancement  of  the  oxygen  chemisorption 
is  moderate  and  comparable  for  either  light  source  within 
experimental  uncertainties.  However  as  the  coverage 
approaches  -  0.7  monolayer,  the  enhancement  of  oxidation  by 
the  248-nm  light  is  seen  to  increase  rapidly,  while  the 
enhancement  by  the  351-nm  light  remains  moderate  This  onset 
of  strong  wavelength  dependence  is  also  seen  in  the  results 
obtained  from  the  GaAs(lOO)  surface  as  shown  in  Figure  3  in 
which  the  rapid  increase  of  enhancement  of  248-nm.  light 
occurs  at  ■  C  L  monolayer  coverage.  Also  noted  is  that  on 
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the  (110)  surface  the  oxygen  coverage  is  smaller  than  that  on 
the  (100)  surface  for  low  oxygen  exposure.  Indicating  a  lower 
Initial  sticking  coefficient  on  the  (110)  surface. 

Figure  4  shows  the  results  of  oxide  growth  as  a  function 
of  photon  energy  using  an  argon-ion  laser  to  irradiate 
GaAs(lOO)  surfaces  in  laboratory  air  ambient  for  20  min  with 
each  of  the  selected  wavelengths.  The  dashed  line  at  7.5  A 
represents  the  natural  oxide  grown  on  the  surface  in  the 
course  of  the  experiment  as  determined  from  the  control 
samples.  The  oxide  thicknesses  due  to  light  irradiation  show 
a  very  sharp  contrast  for  the  oxidation  enhancement.  For 
photon  energies  above  -  4  eV  the  oxidation  was  about  five 
times  faster  than  for  photon  energies  below  4  eV. 
Specifically,  oxide  growth  for  275-nm  and  257-nm  argon-ion 
laser  lines  was  much  more  strongly  enhanced  than  that  for 
307-nm,  350-nm,  454-nm  and  514-nm  laser  lines,  respectively. 
This  agrees  very  well  with  the  results  from  the  excimer  laser 
study  performed  in  the  vacuum  chamber,  as  shown  in  Fig.  1, 
where  the  248-nm  laser  light  shows  a  much  stronger 
enhancement  than  the  351-nm  light. 

IV.  DISCUSSION  AND  CONCLUSION 

The  sharp  increase  of  oxidation  enhancement  at  about  4 
eV  photon  energy  indicates  a  threshold  phenomenon  in  the 
reaction.  This  threshold  is  close  to  the  dissociation  limit 
of  Oj ‘  (sureroxide  ion),  which  has  been  studied 
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theoretically*  and  experimentally  from  the  vibrational 
states*  to  be  4.1  eV.  Oa '  ha6  been  known  to  be  a 
relatively  stable  ionic  species  in  the  gas  phase  from 
experiments  of  low-energy  electron  attachment  to  oxygen 
molecules10.  The  presence  of  0a *  on  the  surface  is  thought 
to  be  the  result  of  the  attachment  of  photogenerated 
electrons  to  the  physisorbed  molecular  oxygen  on  the  surface 
and  was  suggested  for  the  light-enhanced  oxidation  of  both 
GaAs1 1  and  Si11  surfaces.  While  the  photodissociation  of  02 ' 
is  not  known  to  have  been  performed  experimentally,  it  has 
been  suggested  theoretically*.  The  photodissociation  of  02 ' 
through  the  A  -  X  transition  is  a  much  more  likely  event  than 
that  of  the  0,  because  of  the  selection  rule  restraint  due  to 
reflection  symmetry  is  removed  for  the  02  Dissociation 
of  02  ‘  will  generate  very  reactive  atomic  oxygen  species  and 
thus  cause  a  strong  enhancement  effect  in  the  oxide  growth. 

The  Btrong  wavelength  dependence  of  the  light-enhanced 
oxidation,  however,  was  not  observed  below  about  half  a 
monolayer  of  oxygen  coverage,  as  shown  in  Figs.  2  and  3, 
suggesting  that  a  different  mechanism  may  prevail  in  this 
regime.  Bertness  et  al.  have  measured  the  activation  energy 
of  oxygen  chemi Borption  on  the  GaAs (110)  with  and  without 
light  irradiation  below  monolayer  coverage  : 4  They  found  the 
activation  energy  to  be  0.5  eV  for  dark  oxidation  and  null 
under  514-nm  light  irradiation.  It  is  thought,  therefore, 
that  under  light  irradiation  (rf  above-band- gap  photon 


energy),  physisorbed  oxygens  will  become  dissociatively 

/ 

chemisorbed  immediately  after  the  attachment  of 
photogenerated  electrons  as  long  as  there  are  available  sites 
for  chemisorption.  This  can  explain  the  moderate  and 
comparable  enhancement  of  oxidation  by  both  248-nm  and  351-nm 
photons  at  lower  coverage  because  the  photogenerated 
electrons  are  nearly  constant  in  energy  over  this  range  of 
wavelength . 3 

Monch  has  developed  a  simple  model  for  the  saturation 
coverage  of  oxygen  on  GaAs(llO).11  By  assuming  that 
dissociatively  chemisorbed  oxygen  will  be  incorporated  into 
two  adjacent  bonds  in  the  surface  Ga-As  zig-zag  chain,  and 
that  a  repulsive  interaction  occurs  between  two  pairs  of 
chemisorbed  oxygen  along  the  chain,  so  they  have  to  be 
separated  at  least  by  one  unincorporated  Ga-As  bond,  he 
arrived  at  a  saturation  coverage  of  -56%  of  a  monolayer. 
Beyond  this  coverage,  further  oxidation  will  occur  by  the 
diffusion  of  oxygen  species  through  the  first  layer.  We 
think  that  0, '  would  then  form  in  the  oxide  by  the  attachment 
of  ejected  photogenerated  electrons  to  the  oxygen  molecules. 
Photodissociation  of  this  species  will  cause  a  much  stronger 
enhancement  of  oxidation  when  above- threshold  photons  are 
used.  While  a  direct  experimental  proof  of  the  presence  of 
O, "  and  its  photodissociation  is  yet  to  be  made,  this 


mechanism  offers  a  simple  explanation  for  the  strong 
wavelength  deper.der.re  of  the  oxidation  enhancement.  The 


results  of  this  study  suggest  that  the  effect  of  the 
nonthe ratal! zed  photogenerated  carriers*  seems  to  play  a  small 
role  in  the  strong  enhancement  of  oxidation  by  deep-uv 
photons  as  compared  to  the  apparent  photodissociation  of  0,  *  . 

In  summary,  light-enhanced  oxidation  of  GaAs  has  been 
studied  with  visible  to  deep-uv  laser  lines.  The  strong 
enhancement  of  oxidation  by  photons  of  energy  higher  than  4.1 
eV  can  be  explained  as  the  result  of  the  photodissociation  of 
superoxide  ion  (Oj'  )  formed  by  the  attachment  of 
photogenerated  electrons  to  the  oxygen  molecules  on  or  at  the 
surface . 
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FIGURE  CAPTIONS 

Figure  1.  Comparison  of  the  oxidation  of  GaAs(llO)  with  or 
without  laser  irradiation.  Shown  are  the  x-ray 
photoelectron  spectra  of  As2ps , ,  level. 

Figure  2.  AES  study  of  the  oxidation  of  GaAs(llO)  as  a 
function  of  oxygen  exposure  with  irradiation  of 
248-nm  or  351-nm  laser  light  or  without  any  light 
irradiation.  Data  points  beyond  one  monolayer  are 
from  XFS  results. 
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Figure  3.  AES  study  of  the  oxidation  of  GaAs(lOO)  as  a 
function  of  oxygen  exposure  with  irradiation  of 
248-nm  or  351-nm  laser  light  or  without  any  light 
irradiation.  Data  points  beyond  one  monolayer  are 
from  XPS  results. 

Figure  4.  Light  enhanced  oxide  growth  plotted  as  a  function 
of  irradiating  photon  energy.  Dashed  line 

represents  the  natural  oxide  grown  on  the  surface 
in  the  laboratory  air  ambient. 
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AN  IMPROVED  DIFFERENTIAL  VOLTAGE 
TECHNIQUE  FOR  CAPACITANCE  MEASUREMENT 

X.  Wu,  H.  L.  Evans  and  E.  S.  Yang 
Columbia  University,  New  York.  NY  10027.  U.S.A. 

( Received  20  March  1987;  in  revised  form  21  July  1987) 

Abstract— A  differential  voltage  capacitance  technique  is  described  for  measuring  the  capacitance  of 
forward-biased  Schottky  diodes.  This  technique  is  based  on  the  concept  of  an  improved  admittance  bridge. 
Difficulties  arising  from  the  high  conductive  component  were  solved  or  minimized.  Dau  were  obuined 
for  NL-Si  and  Pd-GaAs  Schottky  diodes  with  quality  factors  as  low  as  0.001 .  Compared  to  accurate  phase 
capacitance  spectroscopy  and  other  bridge  methods,  this  technique  is  more  reliable  and  easier  to  operate. 


Admittance  spectroscopy  is  a  valuable  method  for 
investigation  of  semiconductor  interface  states.  In  the 
last  twenty  years,  conductance  and  capacitance  tech¬ 
niques  have  been  widely  used  for  studying  MOS 
devices.  However,  when  a  similar  method  is  applied 
to  measure  the  capacitance  of  forward-biased  metal- 
semiconducior  contacts,  large  errors  occur.  This  is 
because  the  capacitance  signal  of  the  diode  is  being 
overwhelmed  by  the  thermionic  emission  conduct¬ 
ance  (G),  This  conductance  is  sometimes  three  or 
four  orders  of  magnitude  larger  than  the  capacitance 
(tuC).  Under  this  extreme  condition,  ordinary 
methods,  such  as  the  use  of  a  Boonton  capacitance 
meter,  are  no  longer  applicable.  Even  the  accurate 
phase  capacitance  spectroscopy  (APCS)  method  de¬ 
scribed  previously!  1]  is  limned  by  its  instrumentation. 
First  of  all.  the  phase  setting  in  APCS  becomes 
extremely  crucial.  In  pracuce.  for  a  small  phase  error. 
Aa  (around  90"),  we  have: 

A(o>C)  »■  —  GAdi.  (1) 

where  A(<uC)  represents  the  resultant  error  of  the 
measured  susceptance  and  A<h  is  in  radians.  For 
G  uiC  on  the  order  of  Iff*,  even  a  10"*  error  in  phase 
would  cause  a  100%  error  in  the  capacitance. 
Although  the  APCS  method  can  set  the  phase  to  a 
very  high  accuracy  (about  0.01°),  it  cannot  ensure 
that  there  will  be  no  phase  shift  during  the  measure¬ 
ment.  Therefore,  no  matter  how  accurately  the  phase 
is  set  initially,  a  unv  phase  drift  will  nullify  the  whole 
measurement.  Furthermore,  the  sensitivity  of 
the  instruments,  e.g.  the  lock-in  amplifier  (LIA).  is 
limned  by  the  conductance  signal  level.  We  know  that 
the  capacitance  signal  of  a  diode  is  very  weak, 
especially  at  low  frequencies.  It  is  desirable  to  use  a 
high  sensitivity  setting  in  order  to  get  better  noise 
performance.  But  this  would  overload  the  instru¬ 
ments  because  the  capacitance  signal  is  buried  in  a 
huge  conductive  comoonent.  The  large  in-phase  sig¬ 
nal  frequentlv  drives  the  LIA  into  a  nonlinear  region, 
making  the  measurement  imoossible.  For  this  reason, 
we  have  implemented  the  following  di^renual- 


voltage  scheme  for  measuring  device  capacitance 
when  the  quality  factor  (to C/G)  is  as  low  as  0.001. 

The  basic  concept  of  the  differential  voltage 
capacitance  spectroscopy  (DVCS)  is  similar  to  that  of 
a  bridge  circuit.  Since  the  mam  problem  arises  from 
the  high  conductance  that  is  inherently  associated 
with  the  device,  we  propose  to  generate  an  in-phase 
signal  to  cancel  out  the  conductive  component  of  the 
diode,  letting  only  the  capacitive  component  be  re¬ 
tained  to  pass  through  the  phase  sensitive  detector. 
This  way,  the  overload  problem  is  avoided  and  the 
influence  of  the  phase  dnft  is  also  minimized.  As  a 
result  the  accuracy  of  the  measurement  is  greatly 
improved.  A  similar  bridge  method  was  formerly 
suggested  by  Barret  and  Vapaille[2]  and  later 
modified  by  GrevefB].  Barret  and  Vapaille's  method, 
using  an  equal-armed  bridge,  is  tedious  and  time 
consuming  since  an  adjustment  of  temperature  for 
bridge  balance  is  required  at  each  data  point.  Grei  e's 
method,  on  the  other  hand,  needs  a  floating  a.c. 
signal  source  and  a  lock-in  amplifier  with  both  in- 
phase  and  quadrature  outputs.  The  floating  signal 
source  readiiy  picks  up  extra  noise.  Moreover,  the 
biasing  voitage  m  his  bridge  is  applied  to  the  diode 
through  a  current  sensitive  preamplifier,  which  ts 
supposed  for  detecting  the  small  a.c.  signal.  The 
non-separation  of  the  ax.  and  the  d.c.  paths  results 
in  an  overload  of  the  current  preamplifier. 

A  schemauc  description  of  the  differential  voltage 
capacitance  spectroscopy  technique  is  shown  in  Fig. 
1.  A  small  ax.  reference  signal.  If,  derived  from  the 
internal  oscillator  of  a  PAR  124  lock-in  amplifier,  is 
superimposed  on  a  biasing  voltage.  This  combined 
voitage  is  applied  between  point  D  and  the  ground. 
Two  currents  are  generated.  One.  i,,  passes  through 
the  diode,  the  other.  i„,  through  a  variable  resistor.  R 
These  currents  are  converted  into  voltage  signals  by 
resistors  R'.  and  are  then  fed  into  the  inputs  of  the 
LIA  through  a  differential  preamplifier  The  output 
of  the  differenual  preamplifier  contains  onlv  the  a.c. 
component  of  f,  -  t's.  the  d.c  ponton  is  rejected.  If 
R  is  much  smaller  compared  to  I  G  and  R.  the 
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output  of  the  LLA  is: 

y0(4»  »  V  R' A  [(C  —  1  //?  )cos  0  +  wC  sin  d>],  (2) 

where  A  is  the  voltage  gain  of  the  LIA  and  <p  the 
phase  setting  of  the  phase  sensitive  detector.  When 
the  phase  is  set  to  90'  with  a  small  error  A$,  the 
output  becomes: 

K0-(90°+Ad>)-  VIR'A[  —  (C  —  \/R)A<p  +  cuC]. 

(3) 

and  the  error  is: 

A(o>C)-  -{G  -  UR)A<t>.  (4) 

If  the  conductive  component  is  nulled  out.  i.e. 
C  -  MR  a  0.  the  error  will  be  very  small  even  in  the 
presence  of  a  significant  A<t>.  This  can  be  contrasted 
with  eqn.  1  where  A(wC)  is  directly  proportional  to 
G\<t>=  * 

The  measurement  procedure  is  as  follows.  For 
each  given  frequency,  the  phase  of  the  LIA  has  to  be 
set  at  90'  first.  This  is  accomplished  by  seeking  a 
phase  position  of  zero-output  when  alone  is  being 
measured.  The  resistance  of  R  may  be  chosen  to  be 
any  value  comparable  to  1/G  of  the  diode.  While 
keeping  the  phase-range  switch  at  90',  the  phase 
vernier  is  varied  until  the  output  of  the  LIA  is  zero. 
Since  Fb  is  in  phase  with  the  reference  signal,  the 
preceding  adjustment  ensures  that  the  phase  is  set  to 
measure  only  the  capacitive  component.  The  nest 
step  is  to  balance  the  conductive  component.  For  this 
purpose,  we  turn  the  phase-range  switch  to  0°  (with¬ 
out  touching  the  phase  vernier)  and  the  input  mode 
to  A  —  B  so  that  the  quanuty  <7  —  1/7?  is  examined. 
The  variable  resistor  R  is  then  adjusted  until  a  zero 
output  ts  reached  Turning  the  phase-range  switch 
back  to  90'.  the  output  of  the  LIA  is  now  entirely  due 
to  the  capacitance  of  the  diode,  i.e. 

y„-  VR  AloC.  (5) 

The  data  are  then  collected  by  a  plotter. 

All  the  resistors  used  in  the  circuit  should  have 
small  stray  capacitance  and  inductance.  The  whole 
setup  should  be  designed  with  care  to  keep  the  stra> 


capacitance  as  small  as  possible,  especially  at  nodes 
A  and  B.  In  our  experiment,  for  example,  the  sampie 
holder,  which  was  situated  in  a  cryogenics  coldhead 
to  obtain  various  temperatures,  formerly  was  con¬ 
nected  to  node  A.  Between  the  sample  holder  and  the 
cold-plate  there  was  a  thin  insulaung  sheet  for  ther¬ 
mal  conduction.  This  configuration  introduced  an 
extra  capacitance  at  node  A  in  respect  to  the  ground. 
A  rational  arrangement  is  to  put  the  sample  holder 
in  connection  with  node  D  and  let  A  connect  to  the 
probe  needle.  This  reduces  the  stray  capacitance 
significantly.  The  stray  capacitance  between  node  D 
and  ground  is  less  important,  since  it  is  parallel  with 
the  signal  source. 

One  might  use  a  voltage-controlled  resistor  or  a 
photocell  in  place  of  the  variable  resistor  to  automate 
the  balancing  procedure[3).  However,  it  requires  a 
lock-in  amplifier  with  dual-phase  output,  and  a  few 
seconds  delay  at  each  data  point  to  get  a  stable 
reading.  A  continuous  voltage  scan  is  difficult  to 
achieve.  Moreover,  since  the  measurement  is  sensitive 
to  stray  phase  shifts,  it  is  desirable  to  keep  all  the 
circuit  elements  simple.  In  this  circuit,  a  1  kfl  and  a 
50  CL  10-rum  potentiometer  in  senes  were  used  as 
coarse  and  fine  adjustments.  Cables  and  connectors 
were  kept  short  to  minimize  parasitic  inductance  and 
capacitance. 

The  error  caused  by  the  expenmen  til  setup  can  be 
easily  estimated  by  replacing  the  diode  with  a  resistor 
corresponding  to  the  highest  conductance  of 
the  device.  Under  this  situauon.  any  susceptance 
measured  is  due  to  the  setup.  The  stray  capacitance 
was  found  to  be  less  than  80  pf.  which  is  about  I  % 
of  the  measured  device  capacitance  at  low  frequency 
( ~  10  nf  at  100  Hz).  The  stray  caoacuance  thus  deter¬ 
mined  can  be  subtracted  from  the  data  if  higher 
accuracy  is  desired.  At  high  frequencies,  the  relative 
error  may  increase  for  the  device  capacitance  gets 
smaller  Moreover,  it  was  found  that  at  frequencies 
above  30  kHz.  the  cable  inductance  became 
significant  and  our  setup  was  not  suitable  for  these 
measurements.  The  a.c.  stgnat  level  used  m  the  experi¬ 
ment  snould  be  kepi  smaller  than  kT  q  \  large  a.c 


::  ^  ... ... ... 


V- 

u 


a 

.V 

$ 

•s 


,*.n 

■V 


■  ■■N 


/. 

v 


Fig.  2.  The  susceptanee  (id O  and  conductance  ( G )  curves  measured  from  a  Ni-nSi  Schotlky  diode  at 
108  Hz  and  300  K.  Note  the  scale  of  the  conductance  is  1000  tunes  that  of  the  capacitance. 


signal  will  yield  many  high  order  harmonics  because 
of  the  nonlinearity  of  diode  l-V  characteristics. 
However,  a  relatively  large  reference  signal  would  be 
better  for  noise  considerations.  An  optimum  signal 
level  depends  on  tradeoffs  between  the  linearity  and 
the  measurement  sensitivity.  In  our  experiment  best 
results  were  achieved  around  a  10  mV  rms  signal 
level,  where  both  conditions  seemed  to  be  satisfied. 
The  time  constant  of  the  LIA  can  be  chosen  accord¬ 
ing  to  the  noise  situation.  An  appropriate  time  con¬ 
stant  (1  s  in  our  case)  should  provide  an  adequate 
signal  to  noise  ratio  and.  at  the  same  time,  an  efficient 
measurement.  This  depends  on  the  frequency:  usually 
the  lower  the  frequency  the  longer  the  lime  constant 
required. 

The  method  was  used  to  measure  the  forward- 
biased  capacitance  of  a  Ni-nSi  diode[4.5].  Capaci¬ 
tance  vs  voltage  data  at  various  frequencies  and 
temperatures  were  obtained.  Figure  2  shows  a  typical 
result  measured  at  108  Hz  and  300  K.  The  measure¬ 


ment  was  repeated  several  times  over  a  period  of  a 
month  and  yielded  quite  reproducible  results  (r-0/»)- 
Though  the  PAR  124  is  an  old  model  of  Lla.  its  phase 
drift  is  small.  In  normal  operation  the  Maximum 
phase-fluctuation  of  this  model  turns  out  to  be  less 
than  0.01“.  This  phase  stability  was  found  particu¬ 
larly  valuable  to  our  measurements.  Once  the  phase 
setting  was  done  at  a  given  frequency,  the  tempera¬ 
ture  and  voluge  scans  could  be  made  without  any 
readjustment.  It  is  to  be  mentioned  that  the  phase 
resolution  of  most  commercial  look-in  amplifiers  is 
around  0.05°.  This  resolution  is  not  adequate  for  an 
accurate  phase  setting  as  required  in  APCS  method. 
However,  this  will  not  cause  problem  in  DVCS  since 
the  in-phase  component  was  balanced.  Moreover,  the 
phase  resolution  can  be  improved  by  a  frequency 
tuning.  We  had  noticed  that  the  phase  setting  is 
sl’ghtly  dependent  on  frequency.  It  suggests  that 
instead  of  using  the  phase-dial,  the  fine  phase  adjust¬ 
ability  could  be  acquired  by  tuning  the  reference 


ri 


"j"  'J'  'j* 


m 


frequency.  Thjs  method  was  utilized  to  help  set  the 
phase  with  a  resolution  beyond  that  provided  by  the 
phase  vernier.  Ln  order  to  examine  the  significance  of 
the  phase  error,  we  deliberately  turned  the  phase  1° 
off  its  correct  position  With  the  real  component 
carefully  nulled  out.  we  found  the  influence  on  the 
output  reading  to  be  less  than  3%.  while  the  phase 
accuracy  in  the  experiment  could  be  kept  to  less  than 
0.05" 

In  Fig.  2,  data  obtained  from  the  APCS  are  also 
presented  as  a  comparison.  It  can  be  seen  that  the 
results  of  the  DVCS  and  the  APCS  are  in  good 
agreement  at  low  bias.  At  high  bias  the  deviation 
between  the  two  is  caused  by  a  phase  error.  This  was 
confirmed  by  a  phase  readjustment  tn  the  APCS 
measurement,  such  that  a  0.005 ;  phase  shift  could 
cause  the  curve  to  coincide  with  the  DVCS  results. 
Since  the  DVCS  is  much  less  sensitive  to  the  phase 
error,  we  believe  that  this  method  provides  more 
reliable  data.  Figure  3  is  another  example  of  data 
obtained  from  a  Pd-rrGaAs  diode,  where  the  con¬ 
ductance  is  so  high  (30  mmho)  that  overloading 
becomes  a  serious  problem  in  APCS.  But  with 
DVCS.  the  capacitance  can  be  measured  without 


difficulty.  It  might  be  worth  mentioning  that  this 
technique  can  also  be  used  for  a>C  »G.  In  that  case, 
one  merely  puts  a  variable  capacitor  m  place  of  the 
variable  resistor,  where  the  capacitive  component  is 
to  be  cancelled.  This  could  be  useful  in  the  measure¬ 
ment  of  conductance  in  MIS  devices. 
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The  Effect  of  a  Thin  UV-Grown  Oxide  on  Metal-GaAs  Contacts 
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Photochemistry  ha'  been  used  to  grow  very  thin  oxide  laser-  I  '  •  1(1  A  )  on 
GaA'  (100)  surface'  Some  of  these  oxides  have  hern  annealed  t('  produce  Ga 
rich  oxide  surface  layers  Flectrical  characterization  of  meta’  contacts 
■  deposited  on  these  surfaces  show  a  much  greater  variation  of  Schottkv  barrier 

!  height'  for  the  oxidized  surfaces,  cspccialh  for  the  Ga-rich  oxidv  surfaces 

i  The  observed  variation  i'  25%  of  the  GaAs  bandca;  for  Ga-rich  oxide 

1  surface',  compared  to  8%  for  clean  surfaces  Thi  change  in  bamcr  height' 

.  is  toward  the  ideal  Schotlkv  limit  for  contact1  without  interfaa  states 

I  Thi'  behavior  sugpc't'  that  the  oxidized  surface  ha'  reduced  the  densitv  of 

|  interface  state'  present  in  the  mctal-GaV  contacts 
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1  Introduction 


Schottky  barriers  at  metal-semiconductor  contacts  have  been  the  subject  of 
extensive  investigation  for  several  decades.  The  current  understanding  indicates 
that  there  are  at  least  two  main  components  which  determine  a  Schottky  barrier 
height  If  an  interface  contains  a  high  density  of  electronic  state"-,  this  will 
dominate  the  barrier  formation,  allowing  differences  in  the  metal-semiconductor 
wort  function  to  piny  only  a  small  role  in  determining  the  magnitude  of  a  Schottky 
barrier'  This  appears  to  be  the  case  for  the  GaA"  (lOOi  surface  where  Schottky 
bamcr  heights  correlate  only  weakly  with  the  work  function  of  tin  meia!  If  the 
GaAs  surface  can  be  modified  to  reduce  the  density  r»f  interfile.  slates  when  a 
contact  is  formed,  the  Schottky  barrici  which  results  would  be  more  influenced  by 
the  metal  work  function  Chemistry  at  a  semiconductor  surface  i'  believed  to  play 

an  important  role  in  determining  the  Schottky  barrici  height  of  a  metal- 
semiconductor  contact  In  this  work  we  have  used  photochemistr-  to  aller  the  GaAs 
(lOfn  surface  through  the  reaction  of  oxygen  under  deep  1 A  illumination  pnor  to 
metal  deposition  and  have  investigated  the  electrical  characteristics  of  the 

resulting  contacts 

Previous  studies  of  the  effect  of  thin  oxide  layers  on  the  electrical  properties 
of  Schottky  contacts  found  that  for  a  variety  of  metal'  lb:  Schottky  barrier 

increase'  In  those  studies  the  oxide  layer  was  formed  b\  prolonged  exposure  to 
wet  oxvgrn^  or  h\  an  aqueous  chemical  reaction  ^  Oxides  formed  on  GaAs  m  moist 
ambient"  arc  different  from  those  grown  in  dr\  otipn-  piobabh  due  to  the 

incorporation  of  hvdroxy!  group'  in  the  oxide  4  Thvrcfori  ih  effect  of  the 

oxide'  used  in  our  stud,  formed  in  dry  oxygen  would  K  exp  -led  n  K  difTerent 
from  the  prcviou'  stud-e'  In  the  previous  studi.  s  trapped  of  chare  at  the 
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interface  is  believed  responsible  for  the  observed  increase  in  barrier  height 

Chang,  et  ah'*  have  suggested  that  a  suitably  chosen  chemical  reaction  could  allow 

the  formation  of  an  oxide  on  GaAs  which  would  not  cause  the  creation  of  charge 

traps 

Wc  have  reported  that  deep  UV  light  greatly  enhances  the  rate  of  oxidation  of 
GaAs  surfaces  compared  to  dark  oxidation  or  oxidation  with  near  lTV  light  exposure*1 
The  mechanism  responsible  for  this  enhancement  is  not  completely  understood,  but  it 
is  gcncralh  agreed  that  thr  enhancement  is  at  least  in  pan  do.  to  a  carrier 

related  eflect7-^  Q  Our  recent  study  indicates  that  the  deep  1  "V  enhancement  seems 
to  involve  the  photodissnciaticn  of  an  O2’  species  formed  b\  photogcncratcd 

earners  '  Thin  oxide  layrrs  formed  in  this  way  have  been  found  to  be 

stoichiometric  with  respect  to  Ga  and  As  Oxide  layers  which  ar.  non<toirhiometric 
have  also  been  formed  by  annealing  after  oxide  growth  This  result'  in  a  surface 
which  is  Ga-oxidc  rich,  as  will  be  shown  later  Metal  contacts  were  deposited  on 
both  stoichiometric  oxide  and  Ga-oxidc  rich  surfaces  Thr  variation  in  Schottky 
harrier  height  for  various  meta!  contacts  formed  on  oxidi/cd  surface-  wa<  found  to 
depend  much  more  strong''  on  the  work  function  of  the  metal  overlay er  than  is 

observed  on  clear,  GaAs  surface'  It  is  concluded  that  a  mor.  ideal  relationship 
between  metal  work  function  and  Schottky  barrier  height  exist'  for  the  oxidized 

surface  than  occurs  for  clean  GaAs  surfaces 

II  Fxpcrimrnt 

The  experiment'  in  thi'  study  were  performed  in  a  two  charr.Kt  lilV  surface 
analssi'  system  which  allowed  l  'V  light  exposure  of  the  sampk  and  ir.  situ  metal 

Meta’  deposition  wal  from  an  electron  beam  evaporation 


deposition 


source  where 


the  source  to  sample  distance  is  30  cm.  The  GaAs  used  in  this  study  was  (100) 
oriented,  nominally  undoped  with  n m  1x10^  cm"’*.  AH  samples  were  degreased  by 
sequentially  soaking  in  hot  trichlorethylene,  acetone,  and  methanol  They  were 
then  chemically  etched  for  IS  seconds  in  (1:1:10),  oxidized  in 

(10.1.1)  for  1  minute,  quenched  in  10^ -ft. -cm  water,  and  blown  dry 

with  N2  A  sample  was  then  mounted  on  a  Mo  plate  with  melted  In  and  immediately 

inserted  into  the  UHV  system,  where  it  was  heated  in  vacuum  to  550  *C  for  5 
minutes  This  desorbed  the  oxide  layer  formed  during  cleaning  and  served  to  anneal 
the  In  on  the  back  of  the  sample,  providing  an  ohmic  contact  for  electrical 

measurements  Waldrop  has  performed  LEED  studies  to  show  that  a  similar  cleaning 

procedure  provides  a  clean,  well  ordered  surface." 

After  the  surface  had  been  cleaned  and  XPS  performed,  very  pure  O2  was  leaked 

into  the  preparation  chamber,  typically  to  a  pressure  of  10  Ton  Details  of  the 
surface  oxidation  procedure  have  been  previously  reported  ^  Deep  1!V  irradiation 

of  the  GaAs  was  performed  with  a  low  power,  high  repetition  rate  Krl:  exetmer  laser 

(  A  ~  24K  nm  )  of  photon  energy  4.9  eV.  Typical  laser  power  used  is  c  2  mJ  cm^  so 
that  thermal  effects  are  negligible  The  GaAs  was  irradiated  at  100  11/  for  5  min 

to  20  min  to  produce  the  desired  oxide  thickness 

Oxide  thicknesses  were  measured  using  XPS  by  determining  the  ratio  of  areas  of 
the  oxidized  As  peak  to  the  As  peak  from  the  substrate/’  Oxide  thicknesses  between 
3  and  50  A  can  be  accurately  controlled  by  varying  the  Op  pressure  and  duration  of 

irradiation  A  3  A  oxide  can  be  reproducibly  formed  with  a  S  min  irradiation, 
while  a  15  A  oxide  will  form  in  20  min  of  irradiation 

The  oxides  used  in  this  study  are  between  3  and  10  A  thick  in  order  to  produce 


contacts  with  good  ideality  factors  The  oxides  grown  in  the  manner  described 
above  are  found  by  XPS  to  be  stoichiometric  with  respect  to  Ga  and  As.  consisting 
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predominantly  of  G*2®3  However,  we  have  also  formed  nonstoichiometric 

oxides.  Uiing  the  known  fact  that  Ga20j  is  more  thermally  stable  than  AS2O3,  we 

can  produce  a  Ga-oxide  rich  surface  by  annealing  the  oxidised  GaAs.  After  a 
itoichiometric  oxide  is  grown,  and  the  chamber  has  been  re-evacuated,  the  sample  is 
heated  to  500*C^  for  10  min  and  allowed  to  slowly  cool.  This  is  believed  to  cause 
a  reaction  where  AS2O3  reacts  with  the  substrate  to  form  Ga2C>3  while  As  desorbs 

from  the  surface. 

Typical  XPS  data  is  shown  in  Fig  I  comparing  a  surface  with  a  stoichiometric 

oxide  to  the  aarne  surface  after  annealing  The  As  2pj  2  P03*  indicates  that  the 

oxide  layer  is  6  A  thick  before  annealing  Notice  that  after  annealing  there  is 
very  little  signal  from  the  chemically  shifted  peak  due  to  oxidi/cd  As  Since  the 

Ga  peak  cannot  be  used  to  determine  the  thickness  of  Ihc  oxide  layer,  wc  used  the 

preannealed  oxide  thickness  as  a  measure  of  the  amount  of  oxygen  on  the  GaAs 
surface  before  metal  deposition.  The  Ga  peak  after  annealing  however,  is  still 
chemically  shifted  0.9  eV  from  the  substrate  peak  position,  characteristic  of 
oxidised  Ga  The  ratio  of  the  As  2p$/2  S'P13!  *0  the  Ga  2pyi  signal  for  the 
annealed  surface  is  046,  in  agreement  with  the  prediction  that  the  surface  is  Ga 
rich  The  ratio  for  a  stoichiometric  surface  has  been  found  from  MBl  prepared 
samples  to  be  1.4  Notice  in  Fig  1  that  the  O  Is  peak  is  unchanged  in  magnitude 
before  and  after  annealing,  consistent  with  our  assumption  that  the  amount  of 
oxygen  on  the  surface  does  not  change  during  annealing 

The  metals  used  to  form  contacls  in  this  study  were  Pd  Cu.  (  r.  and  Ti  They 

were  chosen  because  they  have  different  reactivities  with  GaAs  oxides,  they  cover  a 
range  of  metal  work  function  values,  and  all  have  been  found  to  rear:  strongly  with 
GaAs  surfaces  14*15.16  p<j  ^  Cu  jpj.  to  an  oxide  layer  on  GaV.  while  Ti  and 

Cr  react  completely  with  the  oxygen  in  thin  oxide  layers  1  TTir  contacts  were 
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formed  by  evaporation  through  ■  thin  Mo  mask  pulled  close  to  the  sample  surface 
just  prior  to  deposition  The  contacts  were  dots  of  8x10'-'  cm^  area  with  at  least 
■x  contacts  per  sample  After  metal  deposition,  the  sample  was  removed  from  the 

l)HV  system  and  the  contacts  were  electrically  characterized  by  dark  current- 
voltage  (l-V)  and  interna]  photoemission  (PE)  measurements 

III.  Results 

The  results  of  electrical  measurements  for  the  vanous  metal1-  or  the  different 
surface  oxides,  as  well  as  on  clean  control  samples,  ar.  summarized  in  table  1 
The  results  are  typical  values  from  measurements  o:  s<\eral  contacts  of  each 
category  The  optimum  oxide  thickness  to  cause  the  maximum  change  in  Schottk 

barrier  height  for  each  metal  was  not  investigated  It  can  be  seen  that  an  oxide 
layer  reduced  the  barrier  height  for  Cu,  Cr.  and  Ti,  but  increased  the  barrier  for 
Pd  It  will  be  discussed  below  that  this  correlates  with  the  work  functions  of  the 
various  metals  It  is  also  clear  that  the  contacts  with  Cia-nch  oxide  layers 
produced  a  larger  barrier  height  variation  for  the  various  metab  and  better 
ideality  factors  from  l-V  measurements  The  better  ideality  factor  is  especially 

noticeable  for  the  Cu  contacts  For  Cu  contacts,  similar  stoichiometric  oxide 
thickness  and  Ga-rich  oxide  thickness  result  in  a  much  better  ideality  factor  for 
the  Ga-rich  oxide  contact 

The  barrier  heights  in  table  1  measured  by  internal  photocmission  agree  with  the 
trends  observed  in  the  l-V  measurements  This  is  imponar'  since  PI  measurements 

are  the  mo*t  direci  measure  of  barrier  heights  ^  not  dcpcr.d.-ir'  on  contact  area, 
physical  constants,  or  low  bamcr  height  regions  In  som-  ease-  PI  measurements 
could  not  be  made  The  variation  in  PI  K  ■••nr  hcich'-  on  clean  surfaces  for 
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different  metals  it  only  0.15  eV,  while  on  Ga-rich  oxide  surface*  the  variation  is 
0.28  eV.  The  increase  in  barrier  height  variation  is  even  greater  from  I-V 

measurements  From  these  it  is  seen  to  vary  0.39  eV  for  Ga-rich  oxide  surfaces, 
compared  to  only  0.12  eV  for  contacts  on  clean  surfaces  This  corresponds  to  25% 
of  the  GaAs  bandgap,  compared  to  8%  for  the  dean  surfaces  examined  here. 

I-V  data  from  each  of  the  metals  and  surface  treatments  is  shown  in  Figs  2  and 

3.  For  the  metals  reactive  with  oxygen,  Cr  and  Ti,  the  shift  of  the  I-V  plot  is 

very  similar,  see  Fig  2.  The  stoichiometric  oxide  shift*  the  current 

significant!)  higher,  while  the  shift  for  a  Ga-rich  oxide  is  even  greater  These 
shift*  indicate  a  decrease  in  Schottk)  barrier  height  In  all  case*  the  plot*  arc 

parallel,  indicating  no  change  in  contact  ideality.  It  wa*  found  th.V  the  oxide 
thickness  had  less  of  an  effect  on  the  contact  ideality  factor  for  metah  reactive 

with  oxygen  than  for  metals  inert  to  the  oxide  layer  Thi*  could  be  of  practical 
importance  in  making  contacts  to  oxidized  surfaces  For  the  Cu  contact  Fig  3(a), 
a  stoichiometric  oxide  only  slightly  shifts  the  plot  to  higher  current,  indicating 
lower  barrier,  but  it  also  degrades  the  contact  ideality  Cu  contact  idcalitir* 
were  the  most  strong])  affected  by  the  stoichiometric  oxide  laser*  of  the  contact* 
in  this  stud)  For  a  Ga-rich  oxide,  however,  the  plot  shift-  to  that  for  a  much 

lower  barrier  with  good  ideality 

The  behavior  for  Pd  contacts  is  the  opposite  of  that  for  the  above  metals  Fig 
3(b)  shows  that  a  stoichiometric  oxide  shifts  the  1-V  plot  downward  corresponding 
to  a  higher  barrier  height  This  plot  indicates  no  degradation  of  contact 

ideality,  but  this  is  true  only  for  oxides  les*  than  7  A  thick  Thicker  oxide* 
increase  the  idealitv  factor  for  the  contact,  and  produce  lowci  apparent  barrier 
height*  Fig  3(b)  also  shows  a  further  increase  in  barrier  heigh'  for  Pil  on  a  Ga 


rich  oxide  lurface  with  the  ideality  factor  restored  to  the  original  value  While 
this  further  increase  in  barrier  height  is  alight,  H  was  consistently  observed 


IV.  Discussion 


The  large  increase  in  the  range  of  Schottky  barrier  heights  for  contacts  formed 
on  surfaces  which  have  been  oxidized  using  deep  UV  irradiation  at  room  temperature 
suggests  that  the  density  of  interface  states  has  been  reduced,  allowing  the  metal 
work  function  to  play  a  larger  role  in  the  barrier  formation  In  fig  4  the 

Schottky  barrier  height  for  the  various  contacts  have  been  plotted  versus  the  work 
function  of  the  contact  metal  Also  shown  are  the  barrier  height  expected  if  the 
Fermi  level  is  completely  pinned  and  the  ideal  barrier  height  limit  in  the  absence 
of  interface  states  in  all  cases,  the  change  in  bamcr  height  for  contacts  on 
oxidized  surfaces  is  toward  the  ideal  limit,  that  is.  in  the  direction  suggesting  a 
decrease  in  interface  state  density.  Again,  this  change  is  most  extreme  for  the 
Ga -rich  oxide  surfaces,  indicating  a  larger  decrease  in  interface  state  density  for 
this  case  Thi'  is  also  supported  by  the  better  ideality  factors  noted  above  for 

contact'  on  Ga-rich  oxide  surfaces  since  a  high  density  of  interface  states  will 
result  in  a  higher  ideality  factor  from  current-voltage  measurements 

A  GaAs  surface  with  a  Ga-rich  oxide  has  been  previously  found  by  XPS  to 
less  surface  band  bending  than  a  surface  with  a  stoichiometric  oxide  19  The 

reduced  band  bendtng  at  the  surface  was  attributed  to  a  reduction  in  the  interface 
slate  density  It  was  suggested  that  this  might  occur  when  the  stoichiometric 

oxide  was  annealed  and  G*20y  was  formed  in  the  reaction  with  the  substrate  In  our 
data  the  interface  state  density  app;  to  be  reduced  even  for  th;  stoichiometric 
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oxide,  but  reduced  further  f  -  it,;  Ga-rich  oxide 
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nonbonding,  occupied  orbital  associated  with  the  AS2O3  molecule  This  bonding 

defect  can  act  as  a  filled  acceptor  state  on  the  surface  The  reduction  of  the 
amount  of  AS2O3  on  the  surface  when  a  Ga-rich  oxide  is  formed  may  be  the  reason  for 
the  larger  variation  of  Schottky  barrier  heights  observed  for  contacts  to  annealed 


oxides. 


Our  results  are  also  in  agreement  with  the  recent  unpinning  of  the  GaAs  surface 
using  a  photochemical  process  in  flowing  water.  A  possible  explanation  for  the 
unpinning  was  a  passivating  Ga-oxidc  layer  on  the  surface. Our  stud\.  however, 
is  the  first  to  form  Schottky  contacts  on  a  well  characterized  Ga-oxidc  rich 
surface  where  an  increased  variation  in  Schottky  barrier  height  i'  demonstrated 

The  similsrity  of  behavior  for  metal'  reactive  with  the  surface  oxide  laver.  and 
metals  inert  to  the  surface  oxide  layer,  indicates  that  chemical  reactions 

involving  the  metal  and  the  oxygen  at  the  surface  arc  not  the  dominant  cause  for 
the  apparent  reduction  in  interface  states  It  is  not  clear,  however,  whether  the 
increased  Schottky  barrier  variation  observed  here  is  due  to  a  reduction  of  surface 
states  before  metal  deposition,  or  a  reduction  in  the  interface  state'  formed 
during  metal  deposition  Oxygen  at  the  surface  may  play  a  role  in  inhibiting  alloy 
formation  between  Ga,  As.  and  the  metal  overlayer. ^  Oxide  compound'  also  give  the 

semiconductor  surface  a  more  ionic  character  which  would  tend  to  decrease  the 

density  of  the  induced  states  when  a  metal  is  deposited  ^  ^  Ibi'  is  consistent 

with  the  observed  increased  reduction  in  interface  state'  fm  a  Ga-oxidc  rich 
interface  since  GajO^  ha'  a  more  ionic  character  than  AsjOy 
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V.  Conclusion 


A  much  wider  variation  in  Schottky  barrier  heights  for  contacts  on  GaAs  (100) 
surfaces  which  had  been  oxidized  using  low  intensity,  deep  l IV  illumination  at  room 
temperature  was  observed  compared  to  contacts  on  clean  GaAs  surfaces  This  has 

been  interpreted  as  resulting  from  a  reduction  of  interface  states  in  the  metal - 
GaAs  contact,  since  the  Schottky  barrier  which  results  for  metal'  deposited  on  the 
surfaces  with  oxides  varies  more  strongly  with  the  work  function  of  the  metal  than 
Schottks  barriers  on  clean  GaAs  surfaces  This  reduction  in  interface  states  15 
especially  pronounced  for  a  stoichiometric  oxide  which  has  been  annealed  to  form  a 
Ga-rich  oxide  surface 
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Figure  1 


Figure  2 


Figure  3 


Figure  4 


X-ray  Photoelectron  Spectroscopy  data  for  GaAs  surfaces  (a)  with  > 

stoichiometric  oxides,  and  (b)  the  same  surface  after  annealing  10  min  j 

•  at  500*C.  The  oxidized  As  signal  is  shifted  1o  lower  kinetic  energy  ( 

The  curves  labeled  (b)  show  a  Ga  oxide  rich  surface.  Note  the  O  Is  ! 

peak  has  the  same  magnitude  before  and  after  annealing  The  data  has  ! 

been  smoothed  and  the  background  has  been  subtracted  in  all  cases 


Natural  log  of  the  current  versus  applied  voltage  for  the  metal* 
reactive  with  a  GaAs-oxidc  layer  (a)  Ti  on  n-GaA*  with  the  indicated 
surface  treatments.  (b)  Cr  on  n-GaA*  with  the  indicated  surface 
treatment' 
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Natural  log  of  the  current  versus  applied  voltage  for  the  metal'  inert 

to  a  GaAs-oxide  layer.  (a)  Cu  on  n-GaA*  with  the  indicated  surface 
treatments  (h)  Pd  on  n-GaAs  with  the  indicated  surface  treatments 


Schottky  Barrier  Height  versus  metal  work  function  Also  plotted  is 
the  ideal  Schottky  limit  for  a  barrier  in  the  absence  of  interface 
states  (given  by  the  difference  between  the  metal  work  function  and 
the  electron  affinity  of  GaAs)  and  a  probable  pinned  limit  if  barrier 
formation  is  dominated  by  interface  states  Points  labeled  CL  are 
contacts  to  clean  surfaces  Points  labeled  SI  arc  contacts  to  I'Y 

grown,  stoichiometric  oxides  Points  labeled  GR  arc  contacts  to  Ga- 
rich  oxide  surfaces.  (  Metal  work  functions  from  ll.B  Michaelson,  J 
Appl  Phvs  48.4729  (1977).  Figure  after  C  R  (rowel!.  HR  Shore,  and 
E.F  laBate  J  Appl.  Phvs.  36.  3843  (I969|  » 
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Table  1. 


Metal 


Surface 

Preparation 


Oxide  PE  I-V 

Thickness  Barrier  Barrier 
(A)  (eV)  (eV) 


Ideality 

factor 


Pd 

Kef . 

Clean  . 

0 

.85 

.87 

.86 

1.03 

1.03 

Stolchioaetric 
Oxide  . 

3.0 

.92 

1.04 

Stoichiometric 
Oxide  . 

5.0 

.87 

.92 

1.04 

Ga-rich 

Oxide  . 

6.8 

.93 

.93 

1.04 

Cu 

Ref . 

Clean  . 

0 

.  92 
.88 

1.05 

1.05 

Stoichiometric 
Oxide  . 

4.2 

.84 

.84 

1.24 

Ga-rich 

Oxide  . 

4.2 

.65 

.77 

1.03 

Ti 

Ref . 

Clean  . 

0 

.74 

.  79 
.78 

1.03 

1.02 

Stoichiometric 
Oxide  . 

7.2 

.72 

.65 

1.02 

Ga-rich 

Oxide  . 

7.1 

.  70 

.63 

1.00 

Cr 

Ref . 

Clean  . 

0 

.  70 

.  73 
.  73 

1.04 

1.05 

Stoichiometric 
Oxide  . 

7.5 

.68 

.62 

1.08 

Ga-rich 

Oxide  . 

6.0 

. 

.54 

1.07 

Ref.  values  from  J.R.  Waldrop,  Appl . Phys . Lett .  46,1002(1984);  J.R. 
Waldrop,  J.  Vac.  Sci.  Technol.  B2,  445  (1984). 

Oxide  thickness  +-  10X  ;  1-V  and  PE  barriers  +  -  .0)  eV. 
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Poisson  phoions  may  be  convened  inlo  sub-Poisson  (amplitude-squeezed)  photons  by  the  action 
of  an  electron  current  configured  in  an  external  feedback  loop.  The  generation  mechanism  in¬ 
volves  single-photon  transitions  so  that  the  source  can  be  made  arbitrarily  sub-Poissoman  Non¬ 
linear  optics  is  noi  invoked  A  useful  configuration  involves  a  photon  emitter  illuminating  a 
detector-source  combination  in  a  closed-loop  system  Two  solid-state  implementations  of  the 
detector-source  combination  are  suggested 

PAC'S  numPcrv  42  50  -p,  72  80  F.y.  73  40Gk.  79  80  +w 


It  is  by  now  well  known  that  nondassica)  light  can  be 
generated  in  the  laboratory.  Such  light  may  exhibit  a 
variety  of  exotic  properties  including  sub-Poisson  pho¬ 
ton  statistics,1"  antibunching’  (sub-Poisson  photon 
statistics  and  antibunching  need  not  necessarily  accom¬ 
pany  each  other4),  and  squeezing'  Sub-Poisson  light 
is  also  called  amplitude-squeezed  light 
The  use  of  an  external  feedback  system6  for  the  gen¬ 
eration  of  sub-Poisson  light  was  first  suggested  by  ex¬ 
periments  in  which  feedback  was  used  to  produce 
sub-Poisson  electrons  Both  the  experiment  of 
Walker  and  Jakeman*  and  the  experiment  of  Machida 
and  Yamamoto"  involved  laser  (Poisson)  photons  il¬ 
luminating  .i  photodeiecinr  and  an  electronic  negative 
feedback  path  from  the  detector  to  the  source  In  the 
former  experiment  the  feedback  directly  controlled  the 
photons  at  the  output  of  the  laser,  whereas  in  the  latter 
experiment  the  feedback  controlled  the  current  at  the 
input  to  the  laser  Nevertheless,  the  principle  in¬ 
volved  in  ihe  two  experiments  is  ihe  same  6  Lntor- 
tunaiely.  these  simple  configurations  could  not  gen¬ 
erate  usable  sub-Poisson  photons  since  Ihe  feedback 
current  is  generated  from  the  annihilation  of  the  in- 
loop  phoions  However,  under  special  circumstances 
an  external  feedback  system  can  be  used  to  produce 
sub-Poisson  photons  such  as  when  correlated  photon 
pairs  arc  available10'1-’  or  a  quantum  nondemolition 
measurement  may  be  made  11  The  resultant  light  may 
only  be  weakly  sub-Poissoman  in  such  cases  because 
these  mulnphoton  processes  involve  a  series  of  detec¬ 
tions  and/or  a  weak  nonlinear  effect 

In  this  Letter  we  propose  a  new  approach  for  Ihe 
conversion  ol  Poisson  photons  into  sub-Poisson  pho 
tons  iiii  swale -photon  transitions  There  does  not  ap- 
(H'.ir  to  he  anv  lund.imcntal  limit  that  would  impede 


the  technique  from  being  used  to  produce  an  arbitrari¬ 
ly  intense  cw  light  source  that  is  also  arbitrarily  sub- 
Poissonian.  It  makes  use  of  the  action  of  an  electron 
current  configured  in  a  feedback  loop  Consider,  for 
example,  an  optical  system  in  which  a  photon  emitter 
illuminates  a  detector-source  combination,  in  a 
closed-loop  circuit  Two  alternative  configurations  are 
shown  in  Fig.  1.  The  character  of  the  photon  emitter 
is  immaterial,  we  have  chosen  it  to  be  a  light-emitting 
diode  (LED)  for  simplicity,  but  it  could  be  a  laser 6  In 
Fig  1(a)  the  photocurrent  derived  from  the  detection 
of  light  from  the  LED  photon  emitter  is  negatively  fed 
back  to  the  LED  input  It  has  been  established  both 
theoretically6  and  experimentally"  that,  in  the  absence 
of  the  block  labeled  "source,"  sub-Poisson  electrons 
will  flow  m  a  circuit  such  as  this  This  conclusion  is 
also  generally  valid  in  the  presence  of  this  block, 
which  simply  acts  as  an  added  impedance  to  the  elec¬ 
tron  flow  Incorporating  this  elemem  into  the  system 
critically  alters  its  character,  however,  since  it  permits 
the  sub-Poisson  electrons  flowing  in  the  circuit  to  be 
converted  into  sub-Poisson  phoions  by  means  of  elec¬ 
tron  transitions  The  key  to  the  achievement  of  this 
effect  is  the  replacement  of  the  detector  used  in  other 
feedback  configurations  with  a  structure  that  acts 
simultaneously  as  a  detector  and  a  source  The  elec¬ 
trons  simply  emit  sub-Poisson  photons  and  continue 
on  their  way  In  the  absence  of  the  feedback  path,  of 
course,  the  electrons  would  simply  emit  Poisson  pho¬ 
tons  Thus,  the  introduction  of  the  electron  feedback 
converts  Poisson  phoions  into  sub-Poisson  photons 
The  configuration  in  Ftg  Hbi  is  similar  except  that 
the  (negative)  feedback  current  modulates  (gates)  the 
light  intensity  at  the  output  of  the  LFD  rather  than  the 
current  ai  its  input 
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FIG  1  Generaiion  of  sub-Poisson  phoions  by  means  of 
negative  feedback  The  feedback  produces  sub-Poisson 
electrons  in  the  detector-source  which,  in  turn,  generate 
sub-Poisson  phoions  (a  >  Negative  feedback  modulating  the 
photon-emitter  input  current  ( b >  Negative  feedback  modu¬ 
lating  the  photon-emitter  output  light  Wavy  lines  represent 
photons:  solid  lines  represent  electron  current  - 1  signifies 
the  feedback  time  constant 


In  Fig  2  we  illustrate  two  possible  solid-state 
detector-source  configurations  The  basic  structure 
consists  of  a  reverse-biased  p*-i-n*  diode  where  the 
p*  and  n*  heavily  doped  regions  have  wider  band 
gaps  than  the  high-field,  ltght-absorbing  and  -emitting 
/  region  This  arrangement  ensures  both  high  quan¬ 
tum  efficiency  at  the  incident  photon  wavelength  (to 
which  the  p*  window  layer  is  transparent!  and  high 
collection  efficiency  (due  to  the  waveguide  geometry) 
for  the  light  generated  by  the  electrons  drifting  in  the  i 
layer  An  edge-emitting  geometry  is  therefore  ap¬ 
propriate  To  maximize  the  collection  efficiency, 
some  of  the  facets  of  the  device  could  be  reflectively 
coated  Two  light-generation  schemes  are  explicitly 
considered  here  single-photon  dipole  electronic  tran¬ 
sitions  between  the  energy  levels  of  the  quantum  wells 
iFtg  2 (a  > )  and  impact  excitation  of  electroluminescent 
centers  in  the  i  region  by  drifting  electrons  (Fig  2 < h » ] 
In  the  first  scheme  (Fig  2<a)l.  the  /  region  consists 
of  a  layer  in  which  the  incident  phoions  are  absorbed 
(detection  region)  and  an  adjacent  quantum-well  re¬ 
gion  in  which  photons  are  generated  by  sequential 
resonant  tunneling  (source  region)  Capasso.  Moham¬ 
med.  and  Cho14  have  recently  demonstrated  the 
sequential  resonant  tunneling  ol  electrons  through  a 
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FIG  2.  (a)  Band  diagram  of  ihe  quantum-well  detector- 
source  The  energy  of  the  incident  photon  emitted  by  the 
LED  is  denoted  The  absorbing  region  (detection  re¬ 
gion)  is  of  GaoailnosjAs  or  GaAs.  typically  1  n m  thick  In 
ihe  source  region,  the  wells  a^re  Garw'InooAs  or  GaAs  in  the 
thickness  range  150-300  A  The  barrier  layers  are  of 
Arsine $jAs  (in  the  case  of  Gac«-lno <y.As  wells)  or  AlAs  (in 
the  case  of  GaAs  wells'  and  mould  be  in  the  thickness  range 
20-50  A  to  achieve  tunneling  times  <  1  psec  Photons  of 
energy  Huy:  are  emitted  via  transitions  from  level  3  to  2 
The  p *  and  n~  wide-gap  regions  are  of  Al04ilnr  As  'or 
AlAs)  The  interface  between  the  and  n*  regions  is  com- 
positionally  graded  < b )  Energy -band  diagram  of  a  detector- 
source  with  electroluminescent  centers  tha:  arc  impact  even¬ 
ed  by  energetic  photoelectrons.  emiinng  photons  with  ener¬ 
gy  Hw 


quantum-well  superlattice  consisting  of  35  periods  of 
Al0  ajlrio  c2As  G40  A)  and  Ga,  j  - 1  rq,  <,  As  (140  A' 
The  observation  of  this  effect,  first  predicted  in  19"1 
by  Kazartnov  and  Suris.1'  has  been  made  possible  by 
the  remarkable  quality  of  superlattice  structures  re 
cently  achieved  by  the  molecular-beam-epitaxy  growth 
technique 

The  structure  presented  in  Fig  2(a)  is  similar  to  i hot 
previously  used  in  the  resonant  tunneling  experi- 
ment  14  The  reverse-bias  voltage  applied  to  the  detec  - 
tor  is  adjusted  such  that  the  potential-energy  drop 
across  the  superlattice  period  (barrier  plus  well'  is 
equal  to  the  energv  difference  between  the  bottom  ot 
the  third  and  first  subbands  ol  the  quantum  wells 
Such  discrete  subband'  (which  have  momentum 
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dispersion  in  the  plane  of  the  layer)  arise  from  size 
quantization  perpendicular  to  the  layers  when  the  elec¬ 
tron  de  Broglie  wavelength  is  comparable  to  the  well 
thickness.16  Under  such  bias  conditions  electrons  un¬ 
dergo  resonant  tunneling  into  the  third  energy  level  of 
the  quantum  wells  and  the  device  current  exhibits  a 
peak  due  to  the  enhanced  tunneling  probability.  Once 
the  electron  finds  itself  in  the  third  energy  level,  it  re¬ 
laxes  by  the  emission  of  either  phonons  or  infrared 
photons. 

The  relative  probability  of  these  two  processes  is  an 
important  parameter  of  the  system  because  a  nonradia- 
tive  decay  represents  random  deletion  and  this  reduces 
the  sub-Poisson  character  of  the  generated  light. 17 
The  most  important  nonradiative  process  is  scattering, 
by  intersubband  optical  phonons,  of  the  electron  locat¬ 
ed  at  the  bottom  of  the  third  energy  level.  This  results 
in  the  electron  finding  itself  in  the  n  =  2  subband  after 
the  emission  or  absorption  of  a  phonon  (Once  the 
electron  has  scattered  to  the  second  subband  it  nonra- 
diatively  relaxes  to  the  bottom  of  the  ground-state 
subband  )  The  intersubband  optical-phonon-assisted 
electron  transitions  are  much  stronger  than  the 
acoustic-phonon-assisted  transitions  and  have  a  rate 
which  is  typically  several  orders  of  magnitude  greater 
than  the  radiative  spontaneous  emission  rate  from  the 
n  =  3  to  the  n  =  2  level  (this  is  estimated  to  be 
=  10-10-  sec'  1  at  a  wavelength  *  =  10  /im,  in  the 
1 40- A  well  structures  considered  in  Ref  14)  This  ra¬ 
diative  transition  is  dipole  in  nature  and  the  emitted 
light  is  polarized  normal  to  the  plane  of  the  layers 
These  photons  cannot  be  reabsorbed  by  ground-state 
electrons  since  t.-,  -  E:*  £;  -  £,  Indeed,  this  dipole 
transition  has  recently  been  observed  in  absorption 
between  the  ground  and  first-excited  slates  of  quan¬ 
tum  wells  11 

T  ne  probability  that  an  electron  undergoes  a  radia¬ 
tive  transition  (i  e  .  the  radiative  efficiency »  r\,  at  the 
emission  frequent  t  £\  -  £'; )  h  can  nevertheless  be 
increased  substantially  by  quenching  of  the  optical- 
phonon-assisted  transition  This  may  be  accomplished 
by  an  appropriate  increase  of  the  well  thickness  there¬ 
by  making  the  energy  separation  £\-  E:  smaller  than 
the  optical-phonon  energy  t  =  .15  meV).  in  which  case 
the  device  would  emu  in  the  middle  infrared  wave¬ 
length  region  Another  method  to  increase  the  radia¬ 
tive  efficiency  is  to  apply  a  magnetic  field  perpendicu¬ 
lar  to  the  layers  In  the  presence  of  the  magnetic  field, 
the  electron  states  in  the  p'ane  of  the  layer  become 
quantized  resulting  in  a  manifold  of  discrete  Landau 
levels  This  total  quantization  of  the  electron  wave 
function  dramaticallv  reduces  the  available  density  of 
states  for  scattering  thus  quenching  the  mtersubbund 
transitions  Recent  experiment-  bv  R\un  o  ill  w  have 
indeed  shown  that  m  the  presence  ol  a  perpendicular 
magnetic  field  the  encrgc  relaxation  rate  ol  an 


electron-hole  plasma  in  quantum  wells  is  reduced  by 
many  orders  of  magnitude.  Although  it  is  difficult  to 
estimate  precisely  the  radiative  efficiency  obtaina¬ 
ble  with  the  selective  quenching  of  optical- 
phonon-assisted  intersubband  transitions  and/or  with 
the  application  of  an  external  magnetic  field,  we  ex¬ 
pect  that  values  between  0.01  and  0.1  should  be 
achievable.  We  have  applied  the  standard  four- 
level-system  rate  equations  to  estimate  that  1-10  nW 
of  infrared  power  at  a  wavelength  &  10  ^m  should  be 
achievable  for  a  photocurrent  =10" 5  A.  Another 
suggested  configuration,  making  use  of  electro¬ 
luminescent  centers  impact  excited  by  energetic  pho¬ 
toelectrons,20  is  presented  in  Fig.  2(b).  The  use  of  a 
superlattice  configuration  in  the  /  region  could  be  use¬ 
ful  in  the  enhancement  of  the  impact-excitation  proba¬ 
bility.21 

The  ability  of  configurations  such  as  these  to  gen¬ 
erate  sub-Poisson  light  requires  a  number  of  interrela¬ 
tions  among  various  characteristic  times  associated 
with  the  system.  Specifically,  the  condition  t  j 
»  rlr,  rresp  must  be  obeyed,  where  ~ j  is  the  feedback 
time  constant,  Tlr  is  the  transit  time  of  the  electrons 
and  holes  through  the  structure,  and  rtesp  is  the 
response  time  of  the  detector-source.  This  condition 
ensures  that  the  detector-source  response  is  fast 
enough  to  be  properly  integrated  by  the  feedback  cir¬ 
cuit.  It  is  also  required  that  T  »  r f.  where  T  is  the 
counting  (integration)  time  of  the  external  detector 
that  monitors  the  sub-Poisson  light.  This  ensures  that 
the  photons  are  monitored  for  a  time  that  is  sufficient¬ 
ly  long  for  the  negative  feedback  to  act  on  the  rate  of 
LED  photon  emissions.  Finally,  it  is  also  required  that 
T  »  where  r,  is  the  spontaneous-emission  life¬ 
time  of  the  radiative  transition  This  assures  that  the 
photon  emissions  will  be  captured  in  the  proper  count¬ 
ing  time  interval 2  22 

An  estimate  of  the  degree  to  which  a  light  source  is 
sub-Poisson  (amplitude  squeezed)  is  provided  by  the 
ratio  of  the  photon-number  variance  to  the  photon- 
number  mean  (Fano  factor),  F„(  7")  -  Var<  n  1/ <  n  > . 
For  sub-Poisson  light,  the  condition  0^  £„<  T)  <  1  is 
obeyed,  the  closer  F„IT)  is  to  zero,  the  more  sub- 
Poissoman  is  the  light  The  mechanism  for  the  gen¬ 
eration  of  sub-Poisson  light  described  in  this  Letter 
can  be  characterized  by  a  sub-Poisson  electron  count¬ 
ing  process  e.  each  event  of  which  independently  gen¬ 
erates  a  random  number  of  photons  M  in  the  source 
The  overall  photon-number  Fano  factor  £„(  T)  can 
then  be  represented  in  terms  of  the  Fano  factor  for  the 
electron  number  £,<  T )  and  the  Fano  factor  for  the 
source  random  variable  FM<7~)  The  relationship 
.<>'  22 

/„«  <  V  ID 

where  <  \f  i  is  the  average  number  of  photons  generat- 
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ed  in  the  source  by  each  electron. 

For  ihe  case  at  hand  it  is  reasonable  to  assume  that 
the  source  random  variable  is  Bernoulli  distributed  in 
each  stage  of  the  device, ::  with  the  probability  of  an 
electron  giving  rise  to  a  photon  denoted  17,.  No  gen¬ 
erality  is  lost  by  consideration  of  the  multilayer  super- 
lattice  case,  which  consists  of  m  independent  stages 
The  source  statistics  will  then  take  the  form  of  a  bino¬ 
mial  random  variable  with  and  Var(A/> 

In  the  presence  of  random  deletion 
arising  from  other  factors  le  g.,  finite  geometrical 
photon-collection  efficiency,  absorption,  external 
detection)  and  background  or  dark  photons,  these 
results  remain  valid  upon  the  replacement  of  tj,  by  the 
quantity  tj/J,  where  n  is  the  overall  quantum  efficiency 
from  electrons  to  detected  photons  and  fi  is  a  factor 
representing  the  admixture  of  independent  dark 
and/or  background  events  17  In  that  case,  /■„  will  be 
the  Fano  factor  for  the  detirieti  photons.  (It  may  be 
useful  to  operate  structures  such  as  those  discussed 
here  at  reduced  temperatures  to  assure  that  (i  =  I  and 
to  enhance  the  resonant  tunneling  current  )  Equation 
( I )  then  provides 

-  I  =  T)/i |  mhe  -  I ) .  (2) 

From  Eq.  (2)  it  is  evident  that  sub-Poisson  behavior 
may  be  discerned  when  h,  <  l/m.  However,  the 
lowest  Fano  factor  is  achieved  when  m  =  I  In  this 
case,  the  photon  counting  process  is  simply  a  randomly 
deleted  version  of  the  electron  counting  process  so 
that  Eq.  (2)  reduces  to  its  usual  familiar  form  2  I7  JJ 

Numerical  estimates  can  be  obtained  for  the  degree 
of  sub-Poisson  behavior  that  is  expected  to  be  observ¬ 
able  for  the  two  structures  discussed  here,  on  the  as¬ 
sumption  that  /J  —  1  For  the  supcrlatticc  device,  t j 
will  be  the  product  of  the  quantum  efficiency  of  the 
cvtcrno/dctecior  ( r/rf  ~  0.8  I.  the  geometrical  collection 
efficiency  of  the  emitted  sub-Poisson  photons  <rjf 
~  0  5  ) ,  and  the  radiative  ell  iciency  ( rj,  =  0  I  I  From 
the  experiment  ol  Machida  and  Yamamoto"  we  esti¬ 
mate  that  /-,  0  2  (which  is  principally  limited  by  the 

quantum  efficiency  of  the  detector  in  the  detector- 
source  combination)  Thus,  for  this  particular  super- 
lattice  structure,  Eq  (2)  provides  an  overall  Fano  (ac¬ 
tor  h„  0  968  lor  the  detected  suh- Poisson  photons 
The  estimated  Fano  factor  lor  the  electroluminescent 
structure  falls  in  the  same  range  These  estimates  pro¬ 
vide  a  significant  potential  improvemeni  over  the 
value  observed  in  the  space-chargc-linnied  Fr.inck- 
Hertz  experiment  ‘  Lower  values  ol  the  Fano  factor 
can  be  achieved  in  structures  that  exhibit  higher  radia¬ 


tive  efficiency.  As  indicated  earlier,  there  is  no  funda¬ 
mental  limit  that  impedes  this  scheme  from  being  used 
to  produce  an  arbitrarily  sub-Poissonian  cw  light 
source  of  arbitrarily  high  intensity. 

This  work  was  supported  in  part  by  the  Joint  Ser¬ 
vices  Electronics  Program.  It  is  a  pleasure  to  thank 
R.  Kazarinov  for  useful  discussions  and  J  Shah  for 
suggesting  the  use  of  a  magnetic  field  to  enhance  the 
radiative  efficiency. 
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Excess  Noise  Factors  for  Conventional  and 
Superlattice  Avalanche  Photodiodes  and 
Photomultiplier  Tubes 

MA.LVIN  C.  TEICH.  senior  member,  seee.  KUN1AKI  MATSUO,  member,  ieee,  and 
BAHAA  E.  A.  SALEH,  member,  ieee 


Abstract— Light  felling  on  a  photodetector  produces  an  output  cur¬ 
rent  that  fluctuates.  The  noise  in  this  signal  arises  from  two  sources: 
randomness  in  the  photon  arrivals  and  randomness  in  the  carrier  mul¬ 
tiplication  process  intrinsic  to  the  photodetector.  A  general  formula  is 
derived  for  the  variance  of  the  photodetector  output  current  in  terms 
of  parameters  characterizing  these  two  sources  of  randomness  (the 
photon-number  variance-to-mean  ratio  for  the  light  and  the  excess  noise 
factor  for  the  detector).  An  important  special  case  of  this  formula  il¬ 
lustrates  that  the  output-current  variance  is  directly  proportional  to 
the  detector  excess  noise  factor  when  the  number  of  photons  at  the 
input  to  the  detector  is  Poisson  distributed.  Explicit  expressions  for 
excess  noise  factors  are  provided  for  three  kinds  of  photodetectors:  the 
double-carrier  conventional  avalanche  photodiode,  the  double-carrier 
superlattice  avalanche  photodiode,  and  the  photomultiplier  tube.  The 
results  for  the  double-carrier  superlattice  device  are  new  ;  it  is  shown 
that  even  a  small  amount  of  residual  hole  ionization  can  lead  to  a  large 
excess  noise  factor.  Comparisons  are  drawn  among  the  detectors  in 
terms  of  their  noise  properties. 

I.  Introduction 

IGHT  falling  on  a  photodetector  produces  an  output 
current  that  fluctuates.  The  noise  in  this  signal  arises 
from  two  sources:  randomness  in  the  photon  arrival  num¬ 
ber  and  randomness  in  the  earner  multiplication  process 
intnnsic  to  the  photodetector.  The  object  of  this  paper  is 
fourfold.  First,  we  calculate  the  vanance  of  the  output 
current  in  terms  of  parameters  that  characterize  these  two 
sources  of  randomness  (the  Fano  factor  for  the  light  and 
the  excess  noise  factor  for  the  detector  multiplication  pro¬ 
cess).  Second,  we  demonstrate  that  in  the  usual  situation 
(Poisson  photon  arrivals),  a  simple  relation  between  the 
output-current  vanance  and  the  excess  noise  factor  of  the 
multiplication  ensues.  Third,  we  give  explicit  expressions 
for  the  excess  noise  factors  of  three  photodetectors  of  in¬ 
terest:  the  conventional  avalanche  photodiode,  the  super- 
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lattice  avalanche  photodiode,  and  the  photomultiplier 
tube.  Finally,  we  compare  and  contrast  the  noise  behavior 
of  these  three  photodetectors,  illustrating  their  relative 
merits. 

The  usual  method  for  experimentally  determining  the 
excess  noise  factor  for  an  APD  involves  a  measurement 
of  the  variance  of  the  output  current1  when  the  device  is 
illuminated  by  a  Poisson  stream  of  photons  [1],  Yet  the 
theoretically  calculated  excess  noise  factor  is  defined  in 
terms  of  the  normalized  second  moment  of  the  gain  ran¬ 
dom  variable  when  a  single  photocarrier  initiates  the  mul¬ 
tiplication  [2]-[4],  The  relationship  between  the  two 
quantities  is  generally  obtained  individually  for  each  APD 
[2]-[5).  Our  first  task  is  to  derive  a  general  formula  that 
relates  these  quantities  for  an  arbitrary  source  of  light  and 
for  an  arbitrary  detector  multiplication  process.  When  the 
number  of  photons  at  the  input  to  the  detector  is  Poisson, 
the  output-current  variance  turns  out  to  be  directly  pro¬ 
portional  to  the  excess  noise  factor. 

Explicit  formulas  for  the  excess  noise  factor  are  pre¬ 
sented  for  several  special  cases:  the  conventional  ava¬ 
lanche  photodiode  (CAPD),  the  superlattice  avalanche 
photodiode  (SAPD),  and  the  photomultiplier  tube  (PMT). 
The  results  for  the  CAPD  are  in  accord  with  the  expres¬ 
sions  obtained  earlier  by  McIntyre  12],  13),  whereas  the 
results  for  the  single-carrier  SAPD  agree  with  those  re¬ 
ported  by  Capasso  et  al  [5]  for  the  graded-gap  staircase 
APD.  The  expressions  for  the  double-carrier  SAPD  are 
new,  although  they  are  related  to  expressions  obtained  by 
van  Vliet  et  al.  [6).  The  formulas  for  the  PMT  were  de¬ 
rived  long  ago  by  Zworykin  et  al.  [7]  and  by  Shockley 
and  Pierce  [8].  The  noisiness  of  the  three  photodetectors 
is  compared  graphically. 

The  excess  noise  factor  is  a  useful  statistic  because  it 
represents,  in  a  compact  way,  the  lowest  order  statistical 
properties  of  the  gain  fluctuations  that  introduce  multipli¬ 
cative  noise.  However,  it  must  be  recognized  that,  aside 
from  photon  fluctuations,  the  excess  noise  factor  does  not 
provide  a  complete  statistical  description  of  the  electron 
current  While  it  is  useful  for  the  calculation  of  quantities 
such  as  the  conventional  signal-to-noisc  ratio  (SNR1  fot 

If  onl>  the  ac  or  'signal''  portion  of  the  current  is  considered,  the 
variance  is  equivalent  to  the  mean  ^lare  current 
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analog  detection,  it  is  inadequate  for  describing  the  per¬ 
formance  of  a  digital-signal  information  transmission  sys¬ 
tem  [9].  Instead,  measures  such  as  probability  of  detec¬ 
tion  and  probability  of  error  must  be  used  for  such 
systems.  These  latter  quantities  are  strongly  dependent  on 
the  tails  of  the  counting  distributions  (beyond  the  second 
moment)  and  therefore  require  a  more  complete  statistical 
description  of  the  electron  current  (e  g.,  the  counting  dis¬ 
tribution)  [10]-[12]. 

II.  Theory 

Consider  a  point  process  representing  the  primary  (pho¬ 
ton-generated)  carriers.  Let  the  number  of  these  carriers 
generated  within  the  time  interval  (0,  T\  be  described  by 
the  discrete  random  variable  a.  Each  of  these  primary  car¬ 
riers,  in  turn,  is  assumed  to  independently  produce  M 
daughter  carriers  ( M  is  the  discrete  gain  random  variable 
representing  the  carrier  multiplication).  The  total  number 
of  electrons  n  produced  at  the  output  of  the  device  is  the 
quantity  of  interest. 

If  a  and  M  are  statistically  independent,  which  it  is  safe 
to  assume,  then 

<n>  =  <M>  ( a >  (1) 


Var  (n)  =  {M )'  Var  (a)  +  (a)  Var  (M ).  (2) 

The  angular  brackets  <  ■  >  represent  the  ensemble  average 
and  Var  ( • )  represents  the  count  variance.  These  relation¬ 
ships  are  known  as  the  Burgess  variance  theorem  [13]- 
[15],  Dividing  (2)  by  (1)  provides 

=  <M>  (3) 

where  the  Fano  factors  5,  are  defined  as 

5j  =  Var  (j)/(j  >.  j  =  n.  a,  M.  (4) 

The  quantity  is  the  Fano  factor  for  the  photogenerated 
carriers.  For  many  photon-counting  distributions,  includ¬ 
ing  the  Poisson,  the  Fano  factor  is  invariant  to  random 
deletion  [14],  in  which  case  is  the  same  as  the  Fano 
factor  for  the  photons  incident  on  the  device  (5pholo„). 
Equation  (3)  is  are  then  independent  of  the  detector  quan¬ 
tum  efficiency  i? 

In  many  detectors,  the  processes  a  and  n  are  filtered 
(continuous)  versions  of  their  discrete  counterparts.  In  that 
case,  we  use  the  spectral  form  of  the  Burgess  variance 

theorem  [15] 

</_>  =  <M>  <L>  (5) 


S„  =  <M>:  Sa  +  2qUa)  Var(M)  (6) 

The  quantities  </„  >  and  </0  >  and  S„  and  Sa  represent  the 
mean  currents  and  power  spectral  densities,  respectively, 
for  the  n  and  a  processes,  q  is  the  electronic  charge.  Since 
the  primary  process  satisfies 

Var  (/„  =  (qlTy  Var  (a),  (7) 


the  relation  B  =  1/2 T  (where  B  is  the  bandwidth  of  the 
system)  can  be  used  to  obtain 

Sa  =  2 q(la)  SB,  (8a) 

thereby  a.lowing  (6)  to  be  simplified  to 

S„  =  2qUa)  KM)2  +  Var(M)].  (8b) 

The  excess  noise  factor  Fe  is  defined  as  the  normalized 
second  moment  of  the  gain  random  variable  for  a  single 
input  photocarrier,  i.e., 

Ft  =  <Af2>/<Af>2.  (9) 

Equation  (9)  can  be  used  in  conjunction  with  (8)  to  pro¬ 

vide 

S„  =  2 <?</*>  <Af>2  [5a  +  ( Fr  -  1)]  (10a) 


Var  (/„)  =  2 «?</*>  B (My  [5Fa  +  (Fr  -  1)],  (10b) 

which  is  the  desired  general  relationship.  Combining  (9) 
with  (2)  leads  to  the  discrete  analog  to  (10b),  which  may 
be  written  as 

Var  (n)  =  <a>  <A#>2  [3Fa  +  ( Fr  -  1)]  (10c) 

or  equivalently  as 

=  <M>  [T0  +  (Fr  -  1)].  (lOd) 

The  excess  noise  factor  can  also  be  expressed  in  terms  of 
the  mean  and  variance  of  the  gain  by 

Fr  =  1  +  [Var(Af)/(Af>:]  (II) 

For  deterministic  multiplication 

Var  (M)  =  0,  F,  =  1  (12) 

whence  the  name  “excess  noise  factor 
The  laboratory  measurement  of  an  experimental  excess 
noise  factor  <t>,  is  often  earned  out  by  determining  the  true 
ac  mean-square  current  at  the  output  of  the  APD  under 
study  and  the  true  ac  mean-square  current  at  the  output  of 
a  device  identical  in  all  respects  except  that  with  unity 
multiplication.  The  ratio  of  these  two  currents  (which  are. 
properly  speaking,  variances)  provides  the  expcnmental 
excess  noise  factor  <t>e  |1).  We  therefore  have 

<t>,  =  Var  (/J/Var  (/„)  =  SnB!SaB  (13) 

Using  (10a)  together  with  (8a)  leads  to 

0,  =  «M>2/3J  [Sa  +  (F,  -  D]  (14a) 

Actual  experimental  measurements  of  <t>,  are  invanably 
carried  out  with  a  source  of  radiation  that  generates  a 
Poisson  flow  of  photocarriers.  In  that  case.  3a  =  1  so  that 
(14a)  becomes 

<f >,  =  <M>:F,.  ( 14b) 

This  explicitly  demonstrates  the  proportionality  of  the  ex¬ 
perimentally  determined  and  theoretically  calculated  ex¬ 
cess  noise  factors  for  Poisson  light 

The  general  result  for  <t>,  given  in  (14a)  may  be  equiv- 
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alently  expressed  in  terms  of  the  mean  <M>  and  Fano 
factors  and  Using  (1 1),  we  obtain 

<*>,  =  <M>  «A#>  +  (15a) 

In  the  case  of  Poisson  photocarriers,  this  simplifies  to 

<t>'  =  <A/>  «A/>  +  Sw).  (15b) 

III.  Excess  Noise  Factor  for  the  Conventional 
APD 

An  expression  for  the  excess  noise  factor  F,  associated 
with  avalanching  in  a  uniformly  multiplying  p-n  junction 
was  first  derived  by  McIntyre  [2].  When  only  electrons 
are  injected  into  the  depletion  layer,  but  both  electrons 
and  holes  have  the  capability  to  impact  ionize,  the  result 
can  be  written  as 

F,  (electron  injection) 

=  \kc{M)  +  (2  -  1/<A*»0  -  Ml;  (16a) 
when  only  holes  are  injected,  it  becomes 
F,  (hole  injection) 

=  -  (2  -  l/<Af »(1  -  *c)/*c].  06b) 

Equations  (16a)  and  (16b)  are,  as  indicated  above,  valid 
for  single-carrier-initiated/double-carrier  multiplication 
(SCIDCM).  The  quantity 

k ,  =  (17) 

in  (16a)  and  (16b)  represents  the  ratio  of  hole-ionization 
probability  per  unu  length  0  to  electron-ionization  prob¬ 
ability  per  unit  length  a  This  ratio  is  assumed  to  be  in¬ 
dependent  of  the  electric  field  £  and  constant  throughout 
the  avalanche  region. :  Equation  (I6bi  may  be  obtained 
directly  from  (16a)  by  using  the  substitution  it,  —  Mkr.  if 
electrons  and  holes  are  both  injected,  the  overall  result  is 
obtained  by  adding  the  two  partial  results  '  The  lower  the 
value  of  it, .  the  lower  the  device  noise  In  experimentally 
determining  the  APD  excess  noise  factor,  the  quantity  that 
is  directly  measured  is  the  variance  of  the  output  current 
in  response  to  a  Poisson  stream  of  photons  at  the  input, 
as  specified  m  (10b)  with  =  1  Using  ( 16a)  and  ( 16b) 
in  ( l Ob i  provides  results  that  accord  with  those  derived  by 
McInty  re  [2] 

The  average  multiplication  (mean  gain)  <Af>  for  a 
CAPD  with  pure  electron  injection,  expressed  as  a  funs 
non  of  the  distance  from  the  edge  of  the  depletion  laser 
*.  is  I2)-|4) 

<  M  )  =  ( 1  -  k,  )/{exp  |a(i,  -  1  )x]  -  t, } .  k,  *  S 

(18) 

‘Mclnorr  |?|  demonsiriltd  ihil  even  if  £)  is  noi  proportion*)  10  a.  > 
juiuble  vtlue  for  A,  called  A,*.  can  be  defined  for  F ,  if  (  M  >  iv  large 
Furthermore,  additional  none  u  introduced  when  light  is  absorbed  on  both 
tides  of  the  junction,  so  that  both  electrons  and  holes  are  injected  into  the 
avalanche  region  (this  is  double  earner  initiated  double  earner  multipli 
cation  or  DCJDCM)  In  that  case  an  effective  escesv  noise  factor  Ff«  can 
be  defined  (4J  McIntyre  s  theoretical  results  for  Si  APD  s  were  espen 
mentally  verified  by  Con  rad  i  |lj 


The  performance  of  digital-communication  systems  in¬ 
corporating  CAPD’s  was  examined  in  detail  by  Person- 
ick.  His  initial  treatment  of  this  topic  (10)  dealt  with  mul¬ 
tiplication  involving  a  single  ionizing  carrier,  as  well  as 
multiplication  involving  two  carriers  with  equal  ioniza¬ 
tion  coefficients.  In  a  subsequent  generalization  [11],  he 
obtained  upper-bound  performance  results  for  double-car¬ 
rier  devices  with  unequal  ionization  coefficients.  Person- 
ick’s  results  are  consistent  with  those  obtained  by  Mc¬ 
Intyre  [3]. 

An  important  special  case  is  that  of  the  CAPD  un¬ 
der  single-carrier-initiated/single-camer  multiplication 
(SCISCM)  conditions.  This  provides  the  lowest  possible 
noise.  Setting  kc  =  0  in  (16a)  or  kc  =  oo  in  (16b)  leads  to 

F,  =  2  -  \KM)  (19) 

which,  with  the  help  of  (10b)  with  5a  =  1 ,  gives  rise  to 

Var  (/„)  =  2qUa)  B(M)  (2<M>  —  1].  (20) 

The  average  multiplication,  readily  obtained  from  (18),  is 
then 

<M>  =  exp  (ctr).  (21) 

Inserting  (21)  into  (20)  gives  the  expression  for  the  vari¬ 
ance  of  a  filtered  Yule-Furry  birth  process  with  j  Poisson 
initial  population  [16].  The  identity  between  the  statistics 
of  the  SCISCM  CAPD  and  the  Yule-Funy  process  is  con¬ 
firmed  by  the  gam  distribution  (in  response  to  a  single 
initiating  event);  it  is  the  shifted  Bose-Einstein  distribu¬ 
tion  in  both  cases  [  10],  [12],  [16]. 

In  the  case  where  the  ionization  coefficients  for  elec¬ 
trons  and  holes  are  equal  (a  =  0.  k,  =  I ).  ( 16a>  and  (16b) 
become 

Fr  =  (M  > .  (22) 

whereupon  (10b).  with  =  1.  provides 

Var  </„)  =  2qUu)  B(M)\  (23) 

as  first  obtained  by  Tager  [17]  The  average  multiplica¬ 
tion  is  then 

<M>  =  1(1  -  an.  k  =  1  (24) 

For  a  device  operated  wuh  either  pure-electron  or  pure- 
hole  delta-function  injection  lie.  pure  injection  at  one  or 
the  other  edge  of  the  depletion  layer),  this  is  the  noisiest 
situation  However,  for  a  *  0.  F,  can  be  even  greater 
than  or  less  than  <M>.  depending  on  where  the  light  is 
absorbed  in  the  junction  ' 

IV  Excess  Noisf  Factor  for  thf  Superlatticf 
APD 

An  illustrative  example  of  a  superlauice  APD  (SAPD) 
is  the  staircase  avalanche  photodiode  It  is  a  graded-gap 
multilayer  device  proposed  by  Williams.  Capasso.  and 
Tsang  [5],  (18).  [19]  for  low -noise  iighi  detection  in  the 
near-infrared  region  of  the  spectrum  The  device  is  of  in 
terest  for  fiber-optic  communications  It  is  designed  to 
achieve  an  enhancement  of  the  impact-ionization  proba 
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bility  ratio,  thereby  minimizing  the  hole-electron  feed¬ 
back  noise  associated  with  conventional  III— V  quaternary 
APD’s  for  which  kc  =  1  [20].  Furthermore,  because  the 
electron  multiplication  can  occur  only  at  a  finite  number 
of  discrete  locations  in  the  device,  the  variability  of  the 
number  of  electrons  generated  per  detected  photon  is  also 
reduced  relative  to  the  CAPD.  (It  has  already  been  ex¬ 
perimentally  shown  that  the  first  superlattice  APD  struc¬ 
ture,  proposed  by  Chin  et  al..  can  provide  an  enhanced 
ionization  ratio  [21)-(25].)  The  fabrication  of  a  staircase 
device  in  this  region  of  the  spectrum,  using  molecular- 
beam  epitaxy,  is  currently  underway  at  AT&T  Bell  Lab¬ 
oratories  [24], 

Although  we  deal  with  the  graded-gap  staircase  SAPD 
extensively  for  purposes  of  illustration,  the  analysis  pre- 


their  model  a  good  starting  point  for  describing  the  excess 
noise  factor  for  the  double -carrier  instantaneous-multipli¬ 
cation  SAPD.  Although  the  theory  is  appropriate  as  it 
stands  for  those  SAPD’s  in  which  both  electron  and  hole 
ionizations  occur  at  discrete  locations  (e  g.,  the  multi- 
quantum-well  SAPD  [23]).  it  must  be  modified  for  the 
staircase  SAPD.  In  this  latter  case,  we  must  incorporate 
a  proper  continuous  theory  for  the  hole-ionization  proba¬ 
bility  [5]  into  the  discrete  shifted-Bemoulli  theory  of  van 
Vliet  et  al. 

The  excess  noise  factor  F ,  for  the  double-carrier  SAPD 
with  electron  injection  may  then  be  obtained  from  the 
expression  for  the  variance  derived  by  van  Vliet  et  al.  [6. 
eq  (73)]  using  the  notational  replacemenis  —  < M ). 
X  -»  P.  and  k  —  kt  along  with  (11)  This  yields 


F,  (electron  injection)  =  1  + 


1'<M))(1  -  k, ) 
-  Pil  •>-*,) 


<M>  k. 


sented  here  is  applicable  for  any  SAPD  in  which  the  ear¬ 
ner  transport  is  perpendicular  to  the  superlattice  planes 
In  such  structures,  the  earners  encounter  a  potential  dis¬ 
continuity  at  the  heterointertaces  at  each  penod  of  the 
multilayer  structure  Thus,  our  results  will  also  appls  to 
the  multiquantum-well  SAPD  structure  o!  Chin  <•:  al 
1 2  1  ] - [ 2 3 ] .  1 2 5 ] .  the  doped-quanturr  well  SAPD  structure 
of  Blauselt  <(  al  12b]  and  the  stored  earner  multiquan 
tum-wel!  SAPD  [2 ~ | -  [ 2^ f  Howeser  the  results  will  not 
apply  to  the  channeling  APD  1 30]  pi]  nor  to  other  de 
\  ices  in  w  hich  the  tamers  are  spatialls  separated  bs  means 
of  a  trjnss  erse  he  id  w  it),  transport  taxing  pi  j.  e  i  n  the  plant 
ot  'he  lasers 

The  gain,  es.e"  noise  ta.tor  and  elcsttor.  probahilits 
distribution  at  the  output  ot  .  stair, a'e  SAPD  have  re 
cently  been  calculated  a«  a  function  ot  the  number  oi 
stages  of  the  device  fn  arid  the  electron  impa-t  ionization 
probability  per  stage  P  under  the  SCISCM  assumption 
[32!  This  analssis  sahd  no)  onls  tor  (he  ctaiKu'e 
SAPD  but  tor  ans  ot  the  perpendu  ular  ,  i-ner  transport 
SAPD  s 

We  now  extend  these  results  to  allow  tor  residual  hole 
initiated  ionization  in  the  graded  regions  ot  the  device 
arising  from  the  applied  electric  held  'The  valence  band 
steps  arc  ot  the  wrong  sign  to  assist  hole  initialed  torn 
zation  indeed  thes  mas  lead  to  hole  trapping  Also,  be 
cause  ot  the  opposing  sondustion  band  quasi  electric 
held  the  electrons  .an  impact  ionize  on's  a'  the  soridu, 
tion-band  dis,  ontinuities  1  I  he  mathematical  results  toi 
low  from  the  treatment  presided  bs  san  A  lu.;  c:  ai  |b| 
with  appropriate  extension  and  reinterpretation  The 
model  pros  ided  bs  these  authors  was  intended  to  de'e  nbe 
mulliplnation  noise  in  C  APD  s  Howeser  their  use  id  a 
deterministic  number  ot  -bitted  Bernoulli  staves  makes 


T  *»f  "'-pv-.sf  r r 
f fffv ia  c*  'n 


,  *.p  .s  *u  -n  p-  hfo 


As  previously.  P  is  the  electron  impact-ionization  proba¬ 
bility  per  stage  and  <Af>  is  the  average  overall  multipli¬ 
cation  for  the  device  The  ratio  of  the  hole-ionization 
probability  per  stage  Q  to  the  electron-ionization  proba- 
bil its  per  stage  P  defines  k ,  tor  the  SAPD.  i  e  . 

k.  =  Q  P  (26a l 

Por  the  staircase  SAPD  Q  is  gisen  by  |5] 

>/ 

Q  =  exp  (  |  J  d: )  -  1  (26bi 

where  o  the  hole  ionization  coefficient  (probability  per 
urn  lergthi  in  the  graded  'egion  and  /-  is  the  length  of 
eadi  stage 

The  average  multiplication  <  M  >  is  obtained  with  the 
help  ot  |b.  eq  i53i]  and  (26a1  which,  together  with  the 
notational  substitutions  X  -*  P.  *i  -•  Q.  \  -»  m.  yield 


Prom  1 2"  i.  n  is  apparent  that  the  expression  for  the  excess 
noise  factor  w  r  2 .*>  r  could  readily  be  expressed  as  a  func¬ 
tion  ot  P.  k,.  ana  m  instead  of  P.  k..  and  <Af  )  When  k, 
*  U.  the  average  multiplication  in  1 2 “7 1  will  increase  wnh- 
■  >u:  limit  tor  certain  parameter  values  tie  .  avalanche 
breakdown  will  occur)  The  validity  of  (27)  is  restricted 
to  the  parameter  space  below  avalanche  breakdown 
The  rmsdihed  excess  noise  factor  {F,  -  1)  for  the 
SCI  DC  M  SAPD  is  plotted  as  a  function  of  the  average 
multiplication  \M)  with  the  help  of  1 2 5  I  and  (27)  The 
modified  excess  noise  factor  is  used  because  it  is  conve¬ 
niently  displased  on  double-loganihmic  coordinates  Fur¬ 
thermore  j-.  in  evident  from  Section  II.  it  is  the  pertinent 
measure  in  the  absence  ot  quantum  fluctuations  In  Figs 
I  and  2  we  illustrate  its  behavior  for  m  =  3  and  10.  re¬ 
spectively  Results  for  several  values  of  k  are  presented 
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Fig  1  Modified  excess  noise  factor  F,  -  1  versus  average  multiplication 
<M)  for  the  single-camer-initiated'double-camcr  multiplication 
(SC1DCM)  SAPD  The  modified  excess  noise  factor  is  plotted  because 
it  can  be  conveniently  displayed  on  double-loganlhmic  coordinates.  It  is 
the  pertinent  measure  in  the  absence  of  quantum  fluctuations  The  num¬ 
ber  of  stages  m  -  3  The  behavior  for  different  values  of  4 .  is  illustrated 
paramctncall)  The  curve  for  4-0  corresponds  to  the  single-carrier- 
inttiated'single-camer  multiplication  (SCISCM)  SAPD  It  is  apparent 
that  even  small  deviations  of  k,  from  0  result  in  substantial  excess  noise 


Fig  2  Modified  excess  noise  factor  F ,  -  1  versus  average  multiplication 
<M>  for  the  SC1DCM  SAPD  with  m  =  10  Again,  the  curve  kt  =  0 
corresponds  to  the  SCISCM  SAPD  Comparison  to  Fig  I  demonstrates 
that  as  m  increases,  the  deleterious  effects  of  residual  hole  ionization 
become  more  pronounced 

parametrically.  The  k,  =  0  curve  corresponds  to  the 
SCISCM  SAPD.  As  for  the  CAPD.  the  lower  the  value 
of  ks,  the  lower  the  noise.  It  is  apparent  that  even  small 
deviations  of  k,  from  0  result  in  substantial  excess  noise. 
This  effect  is  more  pronounced  as  m  increases.  These  re¬ 
sults  provide  limits  on  the  residual  hole  ionization  that  is 
tolerable  in  an  SAPD.  Based  on  a  many-panicle  Monte 
Carlo  simulation,  Brennan  (33)  has  recently  estimated  ks 
to  be  «  10' 1  for  the  multiquantum-well  SAPD  of  Chin  et 
al.  (23]  and  «  10~J  for  the  staircase  SAPD  of  Williams 
et  al  [18].  in  the  GaAs/AIGaAs  system.  He  has  also 
shown  that  even  lower  values  of  ks  may  be  achievable  by 
using  the  doped-quantum-well  SAPD  structure  of  Blau- 
velt  et  al.  [26]. 


The  most  important  special  case,  of  course,  is  that  of 
the  SCISCM  device,  which  is  the  lowest  noise  SAPD. 
From  (27),  the  average  multiplication  for  the  single-car¬ 
rier  multiplication  SAPD  is  easily  shown  to  be 

(Af  >  =  (1  +  P)m.  (28) 

The  excess  noise  factor  is  obtained  by  setting  ks  =  0  in 
(25),  which  leads  to 

Ft  =  I  +  [(1  -  P)l(  1  +  />)][!  -  (1  +  P)'m)  (29a) 

=  (A#)'1  +  2{M)-'ln  -  2<A#>~l~l/m.  (29b) 

Using  (14b),  the  experimental  excess  noise  factor  d>,  will 
then  be  given  by 

<*>,  =  (1  +  P)2m  +  |(1  -  />)/(  1  +  P)][(l  +  P)2m 

-  (1  +  P)m]  (30) 

which,  with  the  help  of  (10b)  and  T0  =  1,  corresponds  to 
the  output  current  variance 

Var  (/„)  =  2  <?</„>  B  {(1  +  P)2/n  +  ((1  -  /*)/(  1  +  />)] 

■  [(1  +  P)2™  -  (1  +  P)m]}.  (31) 

Equation  (29a)  agrees  with  the  formula  obtained  by  Ca- 
passo  et  al.  [5.  eq.  (2)].  Equation  (31)  is  also  in  accord 
with  their  result  [5.  eq.  (1)].  provided  that  the  quantity 
<i:>  in  (5.  eq.  (1)]  is  interpreted  as  the  power  spectral 
density  S„.  All  of  the  formulas  presented  here  are  in 
agreement  with  those  reported  in  [32].  Equation  (29b)  is 
displayed  in  Figs.  1  and  2  (Jt,  =  0). 

Carrying  (29b)  to  the  limit  m  -*  oo  leads  to  the  result 

Fe  =  2  -  l/<AO,  (32) 

which  is  identical  to  that  given  for  the  SCISCM  CAPD  in 
(19).  This  is  as  expected;  in  this  limit,  there  is  an  infinite 
number  of  stages  and  the  probability  is  vanishingly  small 
that  a  carrier  is  produced  by  impact  ionization  in  any  one 
given  stage  of  the  device. 

Finally,  we  consider  the  case  of  equal  ionization  coef¬ 
ficients  for  electrons  and  holes  in  the  SAPD  (P  =  Q.  ks 
=  1).  The  excess  noise  factor,  output-current  variance. 


and  average  multiplication  then  become 

Fe  =  <A/>  -  «M>  -  1  ):lm{M)  (33) 

Var  (/„)  =  2 q{la)  B[{M)-  -  <M>«M>  -  l)2/m] 

(34) 

and 

<M>  =  1/(1  -  Pm),  (35) 

respectively.  Carrying  (33)  to  the  limit  m  —  oo  provides 
F,  =  {M  >  (36) 


which  is  identical  to  the  SCIDCM  CAPD  result  given  in 
(22)  for  the  same  reasons  as  indicated  above. 
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F ig  3  Modified  excess  noise  factor  f  -  I  versus  average  multiplication 
(M)  for  the  SC ! SC M  SAPD  (solid  curves)  with  »■  =  ^  and  10.  the 
SC1SCM  CAPD  (dotted  curve),  and  the  high  gain  GaP  hrst  dynode  PMT 
with  Poisson  multiplication  and  A  =  10  (dashed  curves)  for  m  »  1,4. 
10  This  figure  illustrates  the  best  possible  behav  tor  for  all  three  devices 
Although  the  theoretical  excess  noise  factor  of  the  SAP!)  is  always  su 
pcrior  to  that  of  the  CAPD  the  differentia!  cannot  be  large  because  h, 
<  2  tor  both  dev  ices  The  PMT  exhibits  high  gam  km  noise  behavior 
for  all  useful  values  of  <  Af  )  In  terms  of  the  cxu's%  noise  factor,  it  can 
exhibit  betier  performance  than  the  SAPD 
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Fig  4  Modihcd  excess  noise  factor  F,  -  I  versus  average  multiplication 
<  M  >  for  the  SCIDCM  SAPD  (A,  -  I .  solid  curves)  with  m  =  5  and  10. 
the  SCIDCM  CAPD  (A,  =  I.  dotted  curve),  the  identical-dynode  PMT 


behavior  of  both  devices .  The  excess  noise  factors  for  both 
the  SAPD  and  the  CAPD  then  always  lie  below  2.  as  is 
apparent  from  (19)  and  (29a).  Although  the  theoretical 
performance  of  the  SAPD  is  always  superior  to  that  of  the 
CAPD,  the  differential  (in  terms  of  excess  noise  factor) 
cannot  be  very  large  since  F,  <  2  for  both  cases. 

In  the  opposite  limit  (SCIDCM  with  equal  impact-ion¬ 
ization  probabilities),  the  results  are  displayed  in  Fig.  4. 
This  is  the  worst  possible  behavior  for  both  devices,  as¬ 
suming  single-carrier  (delta-function)  injection.  F,  for  the 
CAPD  is  identically  (M  >  from  (22).  F,  for  the  SAPD 
also  increases  in  approximate  proportion  to  <M  > ,  but  with 
the  coefficient  (1  -  1/m),  which  is  slightly  below  unity, 
as  can  be  discerned  from  (33).  Thus,  the  SAPD  excess- 
noise  curves  lie  slightly  below  the  CAPD  curve  for  finite 
m,  coming  ever  closer  as  m  -*  oo.  m  accord  with  (36) 
The  results  presented  in  Figs.  3  and  4  illustrate  explic¬ 
itly  that  the  feedback  noise  introduced  by  the  process  of 
double-earner  multiplication  is  far  more  deleterious  to 
noise  performance  than  is  the  randomness  associated  w  ith 
uncertainty  in  the  locations  of  the  ionizations 

VI.  Excess  Noise  Factor  for  the  Photomultiplier 
Tube 

The  photomultiplier  tube  (PMT)  is  one  of  the  oldest  and 
most  versatile  of  light  detectors,  having  been  developed 
about  1935  [7]  From  a  noise  point  of  view,  it  has  the 
distinct  advantage  of  being  a  single-carrier  device 
(SCISCM)  since  the  electrons  travel  in  vacuum  An  ex¬ 
cellent  description  of  essentially  all  aspects  of  PMT  op¬ 
eration  is  available  in  the  RCA  Photomultiplier  Handbook 
(34);  in  particular,  the  reader  is  directed  to  Appendix  G 
(pp.  160-176)  for  a  comprehensive  discussion  of  the  sta¬ 
tistical  theory  of  PMT  noise 
The  average  multiplication  for  an  m-stage  device  is 
given  by  [ 34 .  eq  (G-55)] 

m 

<M)  =  II  <6t  >  (37) 

A  =  I 

where  <6t>  is  the  mean  secondary-emission  gain  at  the 
k th  stage.  The  random  variable  6*  represents  the  kth-stage 


with  Bose-Emstein  multiplication  (shon-dash  curves)  and  with  m  -  4 
and  10  and  the  high  gain  GaP-first  -dynode  10-stage  PMT  with  Bose- 
Einxtein  multiplication  and  with  A  =  10  (long-dash  cuncl  This  figure 
illustrates  the  worst  possible  behavior  for  all  three  devices,  assuming 
single  carrier  (delta-function)  injection  For  the  CAPD.  Ff  is  identically 
equal  to  <Af>.  whereas  for  the  SAPD.  F,  increases  approximately  as 
<  M  >  These  results  illustrate  explicitly  that  the  feedback  associated  with 
double-carrier  multiplication  in  APD’s  gives  rise  to  far  more  noise  than 
does  uncertainty  in  the  ionization  locations  The  quintessential^  single 
earner  PMT  display  s  far  and  away  the  lowest  values  of  the  excess  noise 
factor 

V.  Comparison  of  Excess  Noise  Factors  for  the 
SAPD  and  the  CAPD 

The  modified  excess  noise  factors  for  the  SAPD  (solid 
curves)  and  CAPD  (dotted  curves)  are  plotted  versus  the 
average  multiplication  <M>  in  Figs.  3  and  4  for  ks  =  k , 
=  0  and  k,  =  k,  =  1,  respectively.  The  results  in  Fig.  3 
are  for  SCISCM  conditions,  representing  the  best  possible 


secondary-emission  gain  Using  an  expression  for  the  vari¬ 
ance  of  the  multiplication  M  at  the  output  of  the  PMT  |34. 
eq.  (G-56)]  together  with  (11)  gives  the  excess  noise  fac- 


F,  =  1  + 


Var  (M  Var  (62 ) 

<6.  >2  +<a,><6;>: 


Var  (6m) 


<6i>  <*:> 


>  <t>mr 


This  expression  is  of  general  validity  for  single-carrier 
discrete  multiplication  processes.  (Thus,  it  can  also  be 
used  for  the  SCISCM  SAPD  when  the  individual  stages 
have  different  values  of  P .)  It,  and  variants  of  it.  were 
obtained  early  on  [7],  [8],  (35) .  It  is  apparent  from  the 
sequence  of  denominators  in  (38)  that  the  gain  of  the  first 
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stage  <6,  >  has  a  substantial  influence  on  Ft\  the  higher 
this  gain,  the  lower  the  contribution  to  the  excess  noise 
from  the  subsequent  stages.  It  is  this  mathematical  prop¬ 
erty  that  spurred  the  development  of  high-gain  GaP-first- 
dynode  PMT's  at  RCA  [36]. 

Two  useful  special  cases  of  (37)  and  (38)  involve 
PMT's  with  identical  dvnodes  and  PMT's  with  a  high- 
gain  first  dynode.  We  first  assume  that  all  m  dynodes  of 
the  device  are  equivalent,  so  that  the  6k  are  independent 
and  identically  distributed  (iid)  random  variables  for  all 
k .  with  mean  and  variance  given  by  <6>  and  Var  (6),  re¬ 
spectively.  Then  (37 1  becomes 

<A/>  =  <6>m  (39) 

whereas  (38)  becomes 

Var  (6i  1  1 

F,  =  1  *  — - ; - -  1  -  — - (40) 

(o)  i so)  -  1 1 .  <6>  j 


These  expressions  are  exact  and  applicable  to  an\  single- 
earner  dev  ice  with  m  identical  stages  Thus,  the  formula 
for  the  SC1SCM  SAPD  excess  noise  factor  given  in  (29a) 
may  be  obtained  by  setting  <A>  =  1  ~  P  and  Var  (6)  = 
Pd  -  Pi 

We  next  assume  that  all  stjgcs  produce  i id  secondary 
electrons,  with  mean  and  variance  and  Var  (6)  re¬ 
spectively.  imp  tor  the  first  stage  tor  whish 


<n  / 


4  «  .• 


(4  la l 


and 


Var  in  i  =  A  \  ar  < <S i  (41b) 

where  A  is  a  constant  When  4  »  1 .  tho  characterizes 
the  high-gain  first-dynodc  PM  I  ie  g  .  (iaPi  The  mean 
gain  (37)  then  become' 

<A/>  =  A  <A>''  (41c) 

whereas  the  excess  noise  factor  (38>  becomes 
Var  (6)  1  i 

F,  =  i  -  — - — - -  1 - —  .  .  (42) 

.4  <6)  (<o>  -  1)  <A> 

The  similarity  between  (42 1  and  (40)  is  obvious,  with  the 
prefactor  I  d  on  the  right-hand  side  of  (42 1  succinctly 
representing  the  excess-noise-factor  advantage  of  the 
high-gain  first-dynode  tube 

To  proceed  further,  the  variance  of  the  gain  at  the  var¬ 
ious  stages  must  be  specified.  This  is  determined  by  the 
secondary -emission  process  at  the  dy  nodes.  The  simplest 
and  most  frequently  used  model  invokes  Poisson  second¬ 
ary-emission  multiplication  at  every  stage  (Var  (6)  =  < 6 >  > 
[7],  [8],  [35],  1 37] - ( 39] .  A  more  versatile  model  is  pro¬ 
vided  by  the  neagative-binomial  (or  Polya)  distribution, 

[  which  has  been  used  by  Prescott  [40]  and  others  [34], 

i  [41],  [42],  This  distribution  arises  from  a  mixture  of  Pois¬ 

son  distributions  whose  means  are  smeared  in  accordance 
!  with  the  gamma  distribution  [43]  Physically,  the  smear- 

I  ing  is  thought  to  arise  from  the  variability  of  the  second- 

)  ary-emission  efficiency  across  the  surface  of  the  dynode 

i 


I 


[34],  [40].  In  this  case, 

Var  (6)  =  <6>  +  <6>J/D  (43) 

where  D  is  the  “degrees-of-freedom”  parameter  describ¬ 
ing  the  extent  of  the  smearing  [44].  Two  special  limits  of 
the  negative  binomial  are  the  Poisson  distribution  for 
which  D  =  oo  (the  least  noisy)  and  the  Bose -Einstein  (or 
Furry)  distribution  for  which  D  =  1  (the  most  noisy). 

Of  the  various  possibilities  implicit  in  the  results  of  this 
section,  the  overall  lowest  excess  noise  factor  obtains  for 
a  high-gain  first-dynode  PMT  with  Poisson  multiplica¬ 
tion.  In  that  case,  using  (4 1 )— (43) ,  we  obtain 


Equation  (44)  is  plotted  as  the  dashed  curves  in  Fig.  3  for 
A  =  10  (m  =  1,4,  10).  The  gain  of  the  m  =  1  (4)  GaP 
PMT  is  approximately  the  same  as  that  of  the  m  =  5  (10) 
SAPD  It  is  apparent  that  the  excess  noise  factor  of  the 
PMT  can  fall  below  that  of  the  SAPD.  Of  course,  the 
overall  gain  of  a  PMT  can  stretch  to  *  10\  which  is  far 
and  away  greater  than  that  achievable  by  any  APD. 

The  excess  noise  factor  for  Poisson  secondary-emission 
multiplication  without  the  benefit  of  the  high-gain  first  dy¬ 
node  (all  dynodes  identical)  is  obtained  by  setting  A  =  1 
in  (44).  When  <A/>  »  1.  F,  then  takes  the  well-known 
approximate  form 

F,  =  ( Af  > 1  m((( Af > 1  m  -  1)  =  (6>/(<6>  -  1),  (45) 

signifying  essentially  noise-free  multiplication  even  in  this 
case 

The  excess  noise  factor  for  a  high-gain  first-dynode 
PMT  with  Bose-Einstein  secondary -electron  statistics  is 
obtained  from  (4 1  )-(43 >  with  D  =  1.  It  is 


<M)!A  -  1 


«A/>M),,m  -  1 


(44) 


1  +  ((M)M)lm  [  {M)!A  -  1 

(Af  )  L(<Af  )!A)'  m  -  1 


(46) 


which  is  plotted  as  the  long-dash  curve  in  Fig.  4  for  A  = 
10  (m  =  10). 

Finally  ,  the  noisiest  of  the  PMT  cases  considered  here 
arises  for  Bose-Einstein  secondary-electron  statistics  w  ith 
all  dynodes  equivalent.  This  result  is  obtained  by  setting 
A  =  1  in  (46).  For  <  M  >  »  1 .  Ft  then  takes  the  approx¬ 
imate  form 


Ft  *  2  <  Af  >  |,m/(<  Af)l/m  -  1) 

=  2<6>/«6>  -  1).  (47) 

This  is  only  a  factor  of  2  greater  than  the  excess  noise 
factor  in  (45)  for  Poisson  multiplication.  This  case  is  plot¬ 
ted  as  the  short-dash  curves  in  Fig.  4 (m  =  4,  10). 

It  is  apparent  that,  even  at  its  noisiest,  the  PMT  is  a 
relatively  quiet  device.  This  is  a  consequence  of  its  sin¬ 
gle-earner  vacuum  character.  The  low  excess  noise  factor 
has  made  the  PMT  an  indispensible  tool  in  optics  labo¬ 
ratories  since  the  1930's  and  it  is  not  likely  to  be  relegated 
to  the  junk  heap  any  time  soon. 
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VII.  Conclusion 

We  have  obtained  a  general  expression  (10b)  for  the 
output-current  variance  of  a  photodetector  in  terms  of  the 
Fano  factor  of  the  photoelectrons  and  the  excess  noise 
factor  of  the  multiplication  process  intrinsic  to  the  detec¬ 
tor  Ft.  Sources  of  noise  such  as  dark  current,  Johnson 
noise,  and  1//  noise  were  not  included  in  the  simple  for¬ 
mulation  considered  here;  they  can  be  easily  incorpo¬ 
rated,  however.  The  analysis  assumed  instantaneous  mul¬ 
tiplication  in  that  the  measurement  time  was  assumed  to 
be  greater  than  the  time  response  of  the  signal  [32]. 

The  variance  can  be  cast  in  terms  of  a  current  signal - 
to-noise  ratio  at  the  output  of  the  photodetector  given  by 

SNR  (/„)  =  <  /„>/[Var  (/„)]!* 

=  [(Ia)l2qB(5a  +  F,  -  1 )] 1 "  (48) 

where  <  la)  =  tj(  /phmon>.  In  the  usual  situation  of  Poisson 
photon  arrivals  (T„  =  1),  the  output-current  variance  is 
proportional  to  Fe  in  accordance  with 

Var  (/„)  =  2qUa)  B(M)lF,  (49) 

whereupon  the  experimental  excess  noise  factor  is 

<t>,  =  (M)2F,  (50) 

and  the  SNR  is 

SNR  (/„)  =  [<  la>/2qBFr)' :.  (51) 

In  this  case,  optimization  of  the  SNR  simply  involves 
minimizing  F,,  independent  of  <M>  In  the  presence  of 
thermal  noise  and'or  dark  noise,  however,  optimization 
of  the  SNR  is  achieved  at  specific  values  of  <M>  and  F, . 

Under  conditions  of  Poisson  photon  excitation  (or  in¬ 
deed  for  any  distribution  of  photons  for  which  is  con¬ 
stant  and  independent  of  (a>).  the  right-hand  side  of  (3) 
depends  only  on  the  random  gain  M.  Thus,  the  ratio  of 
overall  count  variance  to  overall  count  mean  is  constant 
and  independent  of  the  excitation  level.  The  output  cur¬ 
rent  from  any  multiplying  detector  illuminated  by  such 
light  therefore  has  shot-noise-like  behavior.  A  useful  con¬ 
sequence  of  this  property  is  the  applicability  of  the  square- 
root  normalizing  transformation  [45].  With  the  help  of  this 
computational  tool,  system  performance  can  be  readily 
evaluated  in  approximate  form  [46].  The  presence  of  ad¬ 
ditive  thermal  noise  and/or  dark  noise  will,  however,  de¬ 
stroy  this  shot-noise-like  behavior. 

In  the  complete  absence  of  photoelectron  fluctuations 
(5a  =  0),  and  thermal  and  dark  noise,  the  output-current 
variance  is  no  longer  proportional  to  Ft,  but  is  instead 
proportional  to  (F,  -  1 ).  Th  s,  Var  (/„)  -»  0  as  Fe  -»  1 
In  this  idealized  case,  the  signal-to-noise  ratio  at  the  out¬ 
put  of  the  photodetector  is 

SNR  (/„)  =  [< Ia)l2qB(Fr  -  l)]'7.  (52) 

which  can  become  large  for  a  photodetector  with  F,  -*  1 . 

Expressions  for  the  excess  noise  factors  for  CAPD, 
SAPD,  and  PMT  photodetectors  have  been  set  forth  and 


graphically  displayed.  In  terms  of  existing  devices,  small 
Si  CAPD’s  with  high  quantum  efficiency  and  near-ideal 
performance  (F,  =  2.6  corresponding  to  kr  =  0.006  at 
<AJ>  =  100)  have  been  fabricated  in  the  wavelength  re¬ 
gion  0.4  <  X  <  0.95  /im  [47].  Devices  that  are  even 
more  quiet,  with  Fr  <  2.2  corresponding  to  kc  <  0.002 
at  <  M  >  =  100,  appear  to  be  possible  [48] .  CAPD  devices 
with  essentially  SCISCM  properties  are  therefore  cur¬ 
rently  available  at  this  wavelength.  However,  quaternary 
devices  are  generally  used  in  the  wavelength  region  X  * 
1.5  nm.  Unfortunately,  these  have  kc  *  1  so  that  Fr  is 
much  higher  [20],  Dark  current  and  leakage  current  may 
also  present  difficulties  in  such  devices. 

SAPD’s  offer  promise  in  this  longer  wavelength  region. 
These  devices  have  potential  as  small,  high  quantum  ef¬ 
ficiency,  low-voltage  photodetectors,  with  low  leakage 
current  [5],  [20],  Although  SAPD’s  can,  in  principle,  ex¬ 
hibit  minimal  noise  ( Fe  -*  1 ),  their  average  multiplication 
is  restricted  (<  M)  s  2™  for  SCISCM  operation).  Resid¬ 
ual  hole  ionization  is  also  a  potential  problem  and  it  is 
imperative  to  construct  devices  in  which  this  effect  is  min¬ 
imized  [33].  The  dark-current  behavior  of  various  SAPD's 
has  not  yet  been  established.  Attempts  are  currently  un¬ 
derway  to  construct  a  long-wavelength  staircase  SAPD 
[24], 

Photomultiplier  tubes  have  remarkably  low  excess 
noise,  along  with  the  desirable  properties  of  low  dark  cur¬ 
rent,  high  gain,  good  pulse  resolution,  and  ease  of  oper¬ 
ation  in  the  photon-counting  mode.  However,  as  is  well 
known,  they  suffer  from  limited  quantum  efficiency,  large 
size,  high-voltage  requirements,  luminescence  noise,  and 
afterpulsing  due  to  H*  ions  or  inverse  photoemission  [34] 

PMT's  are  sometimes  used  to  discriminate  between  sin¬ 
gle-  and  multiple-photoelectron  events.  This  capability 
follows  from  the  high  gain  as  well  as  the  narrowness  of 
the  multiplied  electron  distribution  [34]  and  is  associated 
with  a  minimal  value  of  the  gain  Fano  factor  Such 
discrimination  is  more  difficult  for  the  CAPD  which  suf¬ 
fers  from  a  (rather  broad)  shifted  Bose- Einstein  gain  dis¬ 
tribution.  It  may  also  be  difficult  for  the  SAPD  which  ex¬ 
hibits  multiple  peaks  in  the  gain  distribu.ion.  unless  P  is 
close  to  unity  [32],  A  related  matter  is  tL .  operation  of  a 
photodetector  as  a  high-speed  photon  counter.  This  gen¬ 
erally  requires  some  104  electrons/photon  to  overcome 
preamplifier  Johnson  noise  [48].  Although  this  is  readily 
achieved  with  a  PMT,  such  gains  are  not  easily  attained 
with  APD's.  In  particular,  reaching  this  gain  in  a  SCISCM 
SAPD  would  require  a  structure  of  some  15  stages  [48] 

As  a  final  note,  we  reiterate  that  the  excess  noise  factor 
is  an  inadequate  measure  for  describing  the  performance 
of  a  digital-signal  information-transmission  system.  Ap¬ 
propriate  performance  measures  for  such  systems  (e  g  . 
probability  of  error)  require  knowledge  of  the  electron¬ 
counting  distributions.  In  particular,  digital-system  per¬ 
formance  is  strongly  dependent  on  the  tails  of  these  dis¬ 
tributions.  Certain  of  the  photodetectors  discussed  here 
will  have  more  favorable  shapes  for  minimizing  error 
probabilities  than  will  others  This  will  be  elucidated  in  a 
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companion  study  of  error  probabilities  for  a  simple  optical 
receiver,  to  be  presented  elsewhere  (12). 
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Recombination  radiation  from  semiconductors  can  exhibit  a  squeezed  photon  number  by  use  of  a  specially  designed 
space-charge  limited  light-emitting  structure.  The  light  will  exhibit  a  small  Fano  factor  and  large  photon  flux. 
The  structure  will  be  small  in  size  and  have  the  capability  of  being  modulated  at  a  high  rate. 


We  recently  reported  the  generation  of  cw  photon-number - 
squeezed  (sub  Poisson)  light  from  a  Franck-Hertz  experi¬ 
ment  in  which  Hg  vapor  was  excited  by  inelastic  collisions 
with  a  space-charge-limited  (quiet)  electron  beam.1  The 
experiment  operates  by  transferring  the  anticlustering  prop¬ 
erties  of  the  electrons,  resulting  from  Coulomb  repulsion,  to 
the  photons.  The  direction  of  transfer  is  the  inverse  of  that 
encountered  in  the  usual  photodetection  process,  in  which 
the  statistical  character  of  the  photons  is  imparted  to  the 
photoelectrons.  Since  the  mechanism  involves  ordinary 
spontaneous  emission,  it  is  a  first-order  optical  process  and 
can  be  expected  to  produce  a  strong  effect  2 

It  has  been  shown  that  the  combination  of  sub-Poisson 
excitations,  each  associated  with  a  single-photon  emission, 
leads  to  sub-Poisson  photon  counts,  provided  that  certain 
conditions  on  the  characteristic  times  and  areas  of  the  detec¬ 
tion  process  are  obeyed  -  The  sub-Poisson  electron  excita¬ 
tions  are  characterized  by  a  time  constant  r,  that  represents 
the  time  over  which  excitation  events  are  anticorrelated 
(antibunched).  The  single-photon  emissions,  on  the  other 
hand,  are  characterized  by  a  photon  excitation/emission  life¬ 
time  tp.  The  detected  light  will  be  sub-Poisson,  provided 
that  T  »  t,,  tp  and  A  »  Ac.  where  T  is  the  detector  counting 
time.  A  is  the  detector  counting  area,  and  A,  is  the  coherence 
area. 

In  spite  of  these  conditions  being  satisfied  in  the  Franck- 
Hertz  experiment,  the  light  was  only  weakly  sub-Poisson, 
principally  because  of  optical  losses  in  the  experimental 
apparatus  '  A  useful  source  of  such  light  should  exhibit  a 
photon  Fano  factor  Fr(T i  that  is  substantially  less  than 
unity  and  should  produce  a  large  photon  flux  (corresponding 
to  a  large  average  photon  number  <  nCT)  ].  If  the  light  is  to 
be  used  in  an  application  such  as  light  wave  communica 
tions.  the  switching  time  (or  symbol  duration)  T  should  also 
be  able  to  be  made  small  so  that  the  device  can  he  modulated 
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at  a  high  rate.4  On  the  other  hand.  T  must  be  sufficiently 
large  in  comparison  with  the  characteristic  times  rf  and  tp  to 
ensure  that  the  Bub-Poisson  character  of  the  photons  is  cap¬ 
tured  in  the  counting  time.12  Ideally,  the  device  should  also 
be  small  in  sixe  and  produce  a  directed  output  so  that  the 
light  can  be  coupled  to  an  optical  fiber.  The  structure 
should  be  designed  in  such  a  way  that  light  loss  is  mini¬ 
mized.3 

A  number  of  suggestions  have  been  made  for  achieving 
sub-Poisson  excitations  and  single-photon  emissions. 
Probably  the  simplest  method  is  to  discharge  a  capacitor  C, 
containing  a  fixed  charge,  through  a  circuit  containing  a 
photon  emitter  such  as  a  light -emitting  diode  (LED).  The 
current  wavefotm  will  then  be  a  nonstationary  pulse  with 
time  constant  r  «  RC  (where  R  is  the  resistance  of  the 
circuit).  A  steady-state  version  of  this  experiment  would 
make  use  of  a  fixed  current  stabilized  by  an  external  feed¬ 
back  circuit  to  drive  the  LED.  This  can  be  achieved  by 
means  of  a  constant -voltage  source  in  series  with  an  external 
resistor,'  R.  or  in  series  with  some  other  optoelectronic  com¬ 
ponent  with  a  suitable  I-V  characteristic.  Other  configura¬ 
tions.  in  which  the  anticorrelation  property  of  the  electrons 
is  achieved  by  using  external  feedback,  have  been  suggest¬ 
ed. « 

However,  in  all  cases  using  external  feedback,  the  charac¬ 
teristic  anticorrelation  time  of  the  excitations  t,  is  deter¬ 
mined  by  the  feedback  time  constant  of  the  loop  t,.  A  lower 
limit  on  the  feedback  time  constant  is  imposed  by  the  re¬ 
sponse  time  and  transit  time  of  carriers  through  the  device 
and  by  the  RC  characteristics  of  the  feedback  circuitry.  In 
general,  an  internal  feedback  proce:  s,  such  as  space  charge, 
will  provide  a  more  effective  means  of  providing  sub-Poisson 
excitations  than  will  external  feedback  This  is  because  an 
internal  physical  process  is  likely  to  result  in  a  smaller  value 
of  t r  than  will  external  electronic  circuitry  Configurations 
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making  use  of  space -charge -limited  excitations  will  there¬ 
fore  have  the  capacity  of  being  switched  faster7  than  those 
making  use  of  external  feedback. 

Using  another  approach  altogether,  it  has  been  suggested 
that  sub-Poisson  light  can  be  generated  by  the  use  of  nonlin¬ 
ear-optics  schemes  that  rely  on  correlated  photon  pairs  and 
selective  deletion.8-9  An  experiment  using  parametric 
downconversion1011  and  dead-time  optical  gating  has  indeed 
been  successfully  used  to  generate  such  light.12  Unfortu¬ 
nately  this  new  source  of  sub-Poisson  light  suffers  from  a 
number  of  familiar  problems:  a  Fano  factor  very  close  to 
unity,  low  photon  flux,  and  slow  switching  speed. 

It  can  be  shown  that  the  sub-Poisson  electron-excitation 
methods  are,  in  general,  superior  to  the  nonlinear-optics 
methods.  The  advantage  stems  from  several  factors:  (1) 
Photons  naturally  gravitate  toward  Poisson  counting  statis¬ 
tics  and  shot-noise  fluctuations.2-3  It  is  difficult  for  the 
nonlinear-optics  methods  to  undo  this  natural  Poisson  pho¬ 
ton  noise.  Electrons,  on  the  other  hand,  are  often  governed 
by  quieter  thermal-noise  fluctuations/-13  thereby  permit¬ 
ting  F„(T)  to  be  made  smaller.  (2)  Nonlinear-optics 
schemes  in  which  Poisson  photons  are  first  generated  (sub¬ 
ject  to  a  source  power  constraint)  and  subsequently  convert¬ 
ed  into  sub-Poisson  photons  cannot  provide  performance 
gain  in  applications  such  as  light-wave  communications.4 
(3)  Sub-Poisson  electron-excitation  configurations  produce 
light  by  means  of  efficient  single-photon  transitions;  large 
values  for  the  photon  flux  are  therefore  easily  achieved. 
Nonlinear-optics  methods,  on  the  other  hand,  rely  on  (rela¬ 
tively)  inefficient  multiple-photon  transitions.  Further¬ 
more,  they  are  subject  to  photon  interference  effects  that 
can  limit  the  degeneracy  parameter  (and  therefore  the  pho¬ 
ton  flux)  to  small  values.8  (4)  Electron  excitations,  especial¬ 
ly  those  mediated  by  a  physical  process  such  as  space  charge, 
can  attain  a  small  characteristic  response  time  r,  so  that  fast 
switching  can  be  achieved."  (However,  as  a  point  of  interest, 
it  should  be  noted  that  correlated  photon  pairs  and  postde- 
tection  processing,  such  as  subtraction  and  correlation,  may¬ 
be  useful  in  specialized  applications.14) 

We  conclude  that  an  ideal  source  of  squeezed -photon - 
number  light  will  operate  by  means  of  sub-Poisson  excita¬ 
tions,  mediated  by  an  internal  physical  feedback  process, 
and  will  incorporate  a  mechanism  for  achieving  highly  effi¬ 
cient  single-photon  emissions.  It  should  be  small  in  size  and 
preferably  be  able  to  produce  light  in  a  directed  beam. 

We  are  therefore  led  to  propose  a  semiconductor  device 
structure  in  which  sub-Poisson  electron  excitations  are  at¬ 
tained  through  space-charge-limited  current  flow  and  sin¬ 
gle-photon  emissions  are  achieved  by  means  of  recombina¬ 
tion  radiation.  A  device  of  this  nature  will  emit  photon- 
number-squeezed  recombination  radiation.  The  energy- 
band  diagram  for  such  a  space-charge-limited  light-emitting 
device  (SCLLED)  is  illustrated  in  Fig.  1.  Sub-Poisson  elec¬ 
trons  are  directly  converted  into  sub-Poisson  photons,  as  in 
the  space-charge-limited  Franck-Hertz  experiment,1  but 
these  are  now  recombination  photons  in  a  semiconductor. 
In  designing  such  a  device,  carrier  and  photon  confinement 
should  be  optimized  and  optical  losses  should  be  minimized. 

The  basic  structure  of  the  device  is  that  of  a  p*-i-n* 
diode  As  an  example,  the  p*  region  may  consist  of 
Al«,  ^Ga06<As.  several  thousand  angstroms  thick  and  doped 
to  p*  *  5  X  101"  cm'3,  and  of  a  layer  of  smaller-gap  material 


Fig.  1.  Energy- band  diagram  of  a  tpace-charge-limited  light-emit¬ 
ting  device  under  (a)  equilibrium  conditions  and  (b)  strong  forward- 
bias  conditions.  The  curvature  of  the  intrinsic  region  under  for¬ 
ward-bias  conditions  indicates  the  space-charge  potential.  Ef  is 
the  Fermi  level,  and  hu.-  is  the  photon  energy. 


for  confinement,  of  GaAs  with  the  same  thickness  and  dop¬ 
ing  level.  The  intrinsic  (i)  region  would  consist  of  a  2-10-rim 
layer  of  ultralow-doped  (nearly  intrinsic)  Alo^sGaossAs. 
The  n*  region  would  also  be  of  Alo.ssGao asAs.  The  structure 
should  be  grown  on  a  conductive  substrate.  A  0.15-eV  dis¬ 
continuity  in  the  valence  band,  between  the  GaAs  and  the 
Alo3sGao6sAs  layers  of  the  p+  region,  would  ensure  hole 
confinement  in  the  low-gap  material  at  room  temperature. 
The  conduction-band  discontinuity  should  be  about  0.25  eV. 
Near-infrared  recombination  radiation  would  be  emitted 
from  the  GaAs  layer  in  this  LED-like  region.  This  structure 
is  appropriate  for  a  waveguiding  edge-emitting  geometry. 
To  maximize  the  external  quantum  efficiency,  a  surface- 
emitting  geometry  may  be  preferable;  the  surface  should  be 
dome  shaped  to  eliminate  total  internal  reflection.  In  this 
case  the  p*  AlGaAs  layer  is  not  necessary. 

The  I-V  characteristic  for  a  single-carrier  conventional 
space-charge-limited  diode  is  given  by  the  approximate  rela¬ 
tionship15 

</,)  *  (9/8)((,Mn/L3)A<V,)2,  (1) 

where  (/,)  is  the  average  forward  current,  i,  is  the  permittiv¬ 
ity  of  the  medium,  ti„  is  the  electron  mobility,  ( V’,1  is  the 
applied  forward-bias  voltage,  and  A  is  the  device  area.  This 
expression  is  valid  under  strong  injection  conditions  (ap¬ 
plied  bias  voltage  >  built-in  potential),  provided  that  L  » 
Lp,  where  L  is  the  thickness  of  the  intrinsic  region  and  Lp  is 
the  electron  diffusion  length.  Typical  parameter  values  for 
the  structure  shown  in  Fig.  1  are  t,  *  1.15  X  10~12  F/cm,  wn  * 
1000  cm2/V  sec,  L  *  2.5  ? m,  A  *  10'4  cm2,  ( V, )  »  2  V,  and 
(!.,)  *  33  mA. 

The  current  noise  in  such  devices  can  be  quite  low.  It  has 
a  thermal  (rather  than  shot-noise)  character.13 16  The  cur- 
rent-noise  spectral  density  S,(u)  for  a  device  in  which  only- 
electrons  participate  in  the  conduction  process  takes  the 
simple  form16 

5r(a.)-8Wd(/,)/d<V,).  (2) 

where  k  ii,  Boltzmann’s  constant,  fi  is  the  device  temperature 
in  degrees  Kelvin.  d\/r)/d<V',>  is  the  differential  conduc¬ 
tance  of  the  device,  and  w  is  the  circular  frequency.  Com¬ 
binin',  expression  (1)  and  its  derivative  with  Eq.  (2)  provides 
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S,W)/2e<I')  ’Ske/ed’'),  (3) 

where  e  is  the  electronic  charge.  By  definition,  the  electron 
current  is  sub-shot  noise,  provided  that 

-  8M/e<Y  ,i  <  1.  (4) 

The  statistical  properties  of  light  generated  by  sub-Pois¬ 
son  electron  excitations  and  single-photon  emissions  are 
well  understood.-’  Light  is  considered  to  be  sub-Poisson 
when  the  photon  Fano  (actor  F„(7 ")  is  less  than  unity,1'  i.e., 
when 

F„(T)  =  var|n(711/intD'  <  1.  (5) 

The  quantities  var[n(n]  and  \n(T))  are  the  photon-number 
variance  and  photon-number  mean  in  the  counting  time 
interval  [0.  T\.  respectively.  A  source  of  light  may  be  sub- 
Poisson  for  one  value  of  T  and  super-Poisson  for  another 
value  of  T.  In  the  limit  of  a  long-counting-time/large-area 
detector,  the  photons  behave  as  classical  particles,  and  the 
expected  photoelectron  (postdetection)  Fano  factor  F„(T> 
is  given  by1-3 


[Fm<n-l)  =  i,d|Fr<ri-l], 


T  »  rt.  rp;  A  »  Ar  (6) 


Here  FC(T)  is  the  Fano  factor  of  the  exciting  electron  stream 
and  v  is  the  overall  quantum  efficiency  from  electrons  to 
detected  photons.  The  quantity  d  accounts  for  the  admix¬ 
ture  of  independent  Poisson  background  light  (0  <  d  <  1;  d  «* 
1  in  the  absence  of  such  background  light).1 

The  analog  versions  of  Eqs.  (51  and  (6)  are  of  interest  when 
the  detected  photocurrent  or  the  excitation  current  is  con¬ 
tinuous  (as  expected  for  the  SCLLED)  rather  than  when 
they  are  a  sequence  of  discrete  events.  The  formula  analo¬ 
gous  to  Eq.  (6l  relates  the  power  spectral  densities  of  the 
excitation  current  S«,(u.l  and  the  detected  photocurrent 
S*,U  ).  The  ratios  S,(u.-)/2e • /,  •  may  be  regarded  as  Fano 
factors  F,(T,I.  where  ;  ■  e.  m  and  the  (/,>  are  the  mean 
values  of  the  respective  currents.  Here  the  counting  times 
T.  play  the  role  of  inverse  bandwidths  of  the  filters  involved 
In  the  limits  T,  »  t„  rp  and  for  a-  «  1/r,.  l/rp,  we  obtain 

-  1]  =  >jd[St(u-)/2e‘/,'  -  1], 

w  «  1/t,,  1  /t p:  A  »  Ac.  (7) 

It  follows  from  Eqs  (4)  and  (7)  that  a  sub-shot-noise 
electron-excitation  current  will  always  lead  to  a  sub-shot- 
noise  detected  photon  current.  In  accordance  with  Eq.  (4). 
this  implies  sub-Poisson  photon  behavior  when  Sk8/e\ V,)  < 
1.  For  a  space-charge-limited  diode  such  as  that  shown  in 
Fig.  1,  it  is  estimated  that  8 kS/c  l’r'  *  0.1  when  8  *  300  K 
and  IV  =  2  V.  Further  reduction  of  this  ratio  can  be 
achieved  by  reducing  the  temperature  of  the  device.  It  is 
interesting  to  note  that,  in  the  regime  in  which  formulas  (1) 
and  (2l  are  valid,  the  degree  of  sub-shot -noise  behavior  is 
independent  of  the  parameters  t,.  iin.  L,  and  A.  The  values 
of  these  parameters  do  play  a  role  in  determining  the  photon 
flux  and  the  device  speed,  however  More  general  results 
are  available  for  the  double-carrier  space-charge-limited  de 
vice.1 

Using  Eqs.  ( 6 1  and  (Ti,  the  degree  of  photon-number 
squeezing  in  the  detected  photons  is  then 


Fm(T)  =  1  +  v(8k6/e(V,)  -  1).  (8) 

provided  that  background  light  is  absent  (d  =  1 ).  The  over¬ 
all  quantum  efficiency  t)  is  given  by  the  product  VtuVd-  where 
T),lt  is  the  external  quantum  efficiency  of  the  SCLLED  (the 
product  of  the  internal  quantum  efficiency  and  the  geomet¬ 
rical  collection  efficiency)  and  jj<j  is  the  external  quantum 
efficiency  of  the  photodetector.  Although  the  internal 
quantum  efficiency  of  a  LED  is  £0.5, 18  the  external  quan¬ 
tum  efficiency  of  a  dome-shaped  surface  emitter  is  typically 
limited  to  about  0.1 5.1920  If  a  Si  p-i-n  photodetector  is  used 
to  detect  the  photon-number-squeezed  recombination  radi¬ 
ation,  we  can  take  r id  *  0.75,  whereupon  » j  *  0.1125.  Finally, 
assuming  that  8 k6/e(Ve)  *  0.1,  as  determined  above,  we 
arrive  at  an  overall  estimated  postdetection  Fano  factor 
Fm(7T)  *  0.899.  A  commercially  available  LED,  with  ji„t  * 
0.03,  will  yield  Fm(T)  *  0.973.  In  both  cases,  T  can  be  as 
short  as  *  1  nsec. 

The  SCLLED  therefore  promises  to  provide  a  source  of 
photon-number-squeezed  light  with  properties  that  are  sig¬ 
nificantly  superior  to  those  of  the  Hg-vapor  space-charge- 
limited  Franck-Hertz  source'  IF^OT)  *  0.998  with  T  «  1 
Msec]  or  the  dead -time-gated  parametric  downconversion 
source12  \Fm(T)  *  0.9998,  with  T  =  19  Msec],  Indeed,  the 
degree  of  photon-number  squeezing  of  recombination  radia¬ 
tion  from  the  SCLLED  is  essentially  limited  only  by  the 
geometrical  collection  efficiency. 

The  waveguide  geometry  and  superfluoresence  properties 
(single-pass  stimulated  emission)  of  the  edge-emitting  LED 
could  be  advantageous  for  providing  improved  beam  direc¬ 
tionality,  switching  speed,  spectral  properties,  and  coupling 
to  an  optical  fiber. 

The  question  arises  about  whether  there  might  be  a  fur¬ 
ther  advantage  in  combining  space-charge-limited  current 
injection  with  a  semiconductor-laser  structure  rather  than 
with  the  LED  structure  considered  above  This  could  pro¬ 
vide  increased  emission  efficiency  and  additional  improve¬ 
ment  in  the  parameters  discussed  above.  This  will  be  bene¬ 
ficial  when  the  laser  can  be  drawn  into  a  realm  of  operation 
in  which  it  produces  a  state  more  akin  to  a  number  state21 
than  a  coherent  state  (the  coherent  state  has  Poisson  pho¬ 
ton-number  fluctuations  and  minimai  phase  fluctuations). 
Machita  et  al A22  have  shown  that  this  mode  of  operation  can 
be  attained  in  a  semiconductor-laser  oscillator,  within  the 
cavity  bandwidth  and  at  high -photon -flux  levels,  if  the 
pump  fluctuations  are  suppressed  below  the  shot-noise  lev¬ 
el.  Similar  suggestions  have  been  made  by  Smirnov  and 
Troshin23  and  by  Carroll.14 

In  summary,  we  have  shotvn  that  the  generation  of  sub- 
Poisson  light  is  most  readily  achieved  by  the  use  of  sub- 
Poisson  electron  excitations,  mediated  by  a  physical  mecha¬ 
nism  such  as  space  charge,  and  single-photon  emissions. 
We  have  proposed  a  space-charge-limited  light-emitting 
structure  as  a  fast  and  compact  solid-state  device  that  oper¬ 
ates  in  this  manner. 
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The  usual  open-loop  quantum  and  semiclassical  theories  of  light  detection  are  extended  to  include  closed-loop 
operation  in  which  ’.here  is  feedback  from  the  detector  to  the  source.  It  is  shown  that  the  unmistakable  signatures 
ot  nnni  lasMi.i.  light  associated  with  open-loop  detection,  such  as  sub-shot-noise  spectra  and  suo-Poisson  photo- 
oi-int-.  ih-n. : ,  am  over  to  closed -loop  systems  This  behavior  is  illustrated  through  quantitatively  indistinguish¬ 
able  quantum  jnd  semiclassical  analyses  of  two  recent  closed-loop  experiments  in  which  sub-Poisson  photocount 
statistic-  "ere  produced  It  turns  out  that  if  the  open-loop  illumination  does  not  require  the  use  of  quantum 

pi) . deli.  them ,  then  neither  d,-<--  the  closed -loop  illumination.  Conversely,  if  the  open- loop  illumination  is 

nonc.j--ic.i;.  men  the  closed-loop  behavior  must  be  analyzed  quantum  mechanicallv  The  use  of  nonclassical  field 
"■rrei.ui..i.-  •  oiit.un  light  beams  -hat  give  sub -Poisson  open-loop  photocounts  from  these  closed-loop  arrange- 
met:;-  ,-  m-t  eo  anil  generalized  into  a  synthesis  procedure  for  producing  light  beams  with  arbitrary  open-loop 

prion. i  nun:  -tati-iu.- 

1.  INTRODUCTION  The  clarity  ol  understanding  associated  with  open-loop 

photodetection  does  not  extend  to  closed-loop  configura- 
The  usual  rormuiatiotts  of  the  quantum'  and  semiclassi-  tions  in  which  there  is  a  feedback  path  leading  from  the 

cal  '  the.-ries  m  photocetectiun  presume  open-loop  config-  output  of  the  detector  back  to  the  light  beam  at  the  detector 

urations.  :  c  .  that  there  are  no  feedback  paths  leading  from  input.  In  this  paper,  we  develop  the  quantum  and  semicias- 

tht  output  ..I  ihe  photodetector  to  the  light  beam  impinging  sical  theories  of  light  detection  for  closed-loop  configura- 

on  that  detector,  in  -uch  configurations,  the  qualitative  tions1"  The  fundamental  quantity  of  interest  in  these  then- 

and  quantitative  distinctions  between  the  quantum  and  ries  is  the  random  point  process  formed  by  the  photodetec- 

semiclassu al  theories  are  well  understood.  In  the  quantum  tion  event  times.  Thus  we  begin,  in  Section  2.  with  a  high- 

theorv.  photocurrent  and  photocount  randomness  arise  level  review  of  such  processes,  focusing  on  their  application 

lrom  ihe  quantum  noise  in  the  illumination  beam,  whereas  to  open-loop  photodetection.  In  particular,  we  introduce 

in  the  .-emiciassicai  theorv  the  tundamental  source  of  ran-  the  doubly  stochastic  Poisson  process  (DSPP)1" -11'  of  semi- 

domne--  .-  associated  with  the  excitations  of  the  atoms  classical  open-loop  theory and  the  more  general  self-excit- 

torming  -ne  detector.  Nevertheless,  the  quantum  theory  ing  point  process  (SEPP' rl'  of  quantum  open-loop  theory 

-ubsume-  T.t-  -emicias-ical  theorv  in  a  natural  wav  in  that  This  material  both  establishes  the  analytical  framework  that 

tht-ir  i>pv!'.-:,M,p  predictions  coincide  exactly  when  the  quan-  we  need  for  the  closed-loop  treatment  and  summarizes  the 

Him  livid  II  imin.n m.g  •  hi-  photodetector  in  the  former  ap-  -emu  las-icai  limits  and  nonclassical  signatures  ot  open  -  loop 

proai  h  .-  .c.  ,i  v  i.is-K  ai -tate.  i.e  a  Glauber  coherent  state  or  operation 

a  .  ij'-i.  id  Mi'.dom  m.v  :re  ot  -m  h  -tate-  Inasmuch  a-  ( >ur  anaiv  -i-  •■!  dosed  -  loop  photodetection  begin-  in  Set  ■ 

it  i-  "it:-  t  :'.i iv . '  ,uid  -.vit!-.  -.une  difficulty,  that  light  1 1. .it  Here  we  use  the  incremental  point  prnce-s  de-crip 
1  ir-.i  ri;  -  .  •  -'.if.  g.-ni-r.t;i-.t  wn.-,-  quantum  statistics  tail  t«.r  tr.-m  Section  2  to  show  mat  both  the  semu  ius-n.ii  and 

.ot -mi •  ■  ::i  ;.i— n.ii  :t  i-  not  -urpn-ing  that  the  -emi  quantum  theorie-  lead  to  SEPP  -  i,.r  the  dosed  loop  .  a-e 

■  -  :t, i- .  n-i!  ■  o,.  -in  ma:n-tav  o|  photo  A-  a  re-ui:  t he  inini-takahle  open -loop  signature-  ot  non 

■ !'  ' 1 1 ' .  '  v  ■  i.  ar  .  u  *"  -  n  ,i-  -oit  -hot  n.,i-,  -pret r.i  .in::  -,P'  i 1  -i- - 
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soil  photocounts.  do  not  carry  over  to  closed-loop  systems. 
We  illustrate  this  behavior  by  providing  quantitatively  in¬ 
distinguishable  quantum  and  semiclassical  explanations  for 
two  recent  closed-loop  experiments  in  which  sub-Poisson 
photocounts  were  produced. '“  :l  There  is,  nevertheless,  a 
vital  distinction  between  the  two  theories  as  applied  to  these 
experiments.  According  to  the  quantum  theory,  nonclassi- 
cal  open-loop  beams  can  be  extracted  from  these  closed-loop 
arrangements  by  augmenting  them  to  exploit  the  nonclassi- 
cal  light-beam  correlations  associated  with  photon  twins2223 
or  Kt.-r-effect  quantum  nondemolition  (QND1  measure¬ 
ments.21 2J  Indeed,  the  former  approach  was  recently  used 
to  produce  a  nonclassical  open-loop  beam.25  W’e  address 
these  field-extraction  procedures  in  Section  4  and  show  that 
they  can  be  generalized  to  a  schema  for  synthesizing  light 
beams  of  arbitrary  photocount  statistics. 

2.  POINT  PROCESSES  AND  OPEN-LOOP 
PHOTODETECTION 

For  the  purposes  of  this  paper,  the  general  photodetection 
construct  ol  interest,  be  it  semiclassical  or  quantum,  open  or 
closed  loop,  takes  the  form  shown  in  Fig.  I.  A  quasi-mono- 
chromatic  paraxial  scalar  light  beam  of  nominal  frequency  i-, . 
illuminates  the  active  region  of  the  photodetector.26  The 
quantities  ol  interest  at  the  detector  output  are  the  photo¬ 
current  i.  and  the  photocount  record  ,V..  The  former  is  a 
train  ol  impulses  leach  ol  area  i ;.  with  q  the  electron  charge! 
located  at  the  photorieiectinn  event  times  It  The  latter 
counts  the  number  nl  -uch  events  that  have  occurred  in  the 
time  interval  I".  :  i  and  is  given  bv 
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The  event  time-  '  that  underlie  both  i  and  .V.  comprise  a 
random  point  process.  For  convenience,  they  will  be  ar¬ 
ranged  m  increasing  >rder  with  /  denoting  the  first  event  in 
the  interval  ".  *  j>  sketched  in  Fig 

A.  Pom'-Process  Statistics 

Without  i'-s-  .it  generatin',  we  shall  confine  ourselves  to 
point  pnm-s-e-  that  u-e  cnnriitioiiailv  orderly  :  in  essence, 
■his  mean-  'Pat  event-  *ucur  one  at  a  time  More  preciselv. 
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function!  is  the  conditional  probability  per  unit  time  t,  r 
there  to  be  an  event  at  t,  given  the  history  of  the  process  up 
to  f.2S  In  general.  m:  may  be  an  arbitrary  nonnegative  fun., 
tion  of  |tr.  A’,!,  in  which  case  Eqs.  (3i-i5i  constitute  the 
incremental  statistical  generator  of  the  general  SEPP  ' 
In  its  simplest  form,  u,  does  not  depend  on  the  event  history, 
and  Eqs.  (3)-(5)  then  describe  the  Poisson  process  with  rate 
function  a/-  Of  interest  in  the  sequel  is  the  DSPP.  w  nose 
incremental  statistics  are  given  by  Eqs.  ( 3 )— (5 )  with  ' 


M,  =  N,), 
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ut(y,  t. . .  y.)  =  lim  (At)* 

ai  —a 


Pr 
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(7) 


u-,ri •  r: . N1  =  PI 

fora  DSPP  with  random  rate  function  V  we  have  that 
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where  angle  brackets  denote  expectation  and  X,  is  a  nonneg 
ative  random  process  that  is  not  directly  influenced  bv  the 
point  process,  i.e.,  X,  is  conditionally  independent  of  It-.  .Yd 
given  knowledge  of  )Xr:  t  <  (|.  As  its  name  implies,  the 
DSPP  is  a  Poisson  process  whose  rate  function  is  the  random 
process  X,.  It  is  worth  noting  he  e  that  the  class  of  SEPP  s  is 
known  to  be  broader  than  that  of  DSPP's:  this  point  will  be 
illustrated  below  in  the  context  of  open-loop  photodetec- 
tion. 

A  useful  alternative  specification  of  point-process  statis¬ 
tics  can  be  made  through  multicoincidence  rates 
(MCR's).23 17  The  fcth-order  MCR  is 


for  k  =  1.  2 .  Basically,  tody,  y . y)Af'  is  the 

probability  that  events  are  registered  within  At  intervals 
about  the  distinct  times  y,  y . r,.  No  ordering  is  im¬ 

plied  among  the  nor  is  the  possibility  of  additional 
events  excluded.  For  a  Poisson  process  with  rate  function 
m-.  we  have  that 
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“'*(*1.  Tn. . Tk)  =  X, ^  ■  (9) 

There  is  a  more  genera!  result  connecting  MCR's  to  condi¬ 
tional  rate  functions  for  arbitrary  SEPP's,  but  it  is  not  suffi¬ 
ciently  explicit  to  warrant  inclusion  here.17 

In  terms  of  the  MCR's.  we  have  the  following  results  for 
the  photocount  and  photocurrent  statistics  associated  with 
the  point  process  (t,|.  The  count  probability  distribution  is 

”  ft  ft 
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stationary  random  process  with  mean  (P)  and  noise  spec¬ 
trum  Spp(f),  then  our  DSPP  formulas  yield 

(«)  *  9i)(P>/h»’0  (19) 

and 

SJf)  =  q  ( i )  +  (qri/hp0)^Spp(f)  (20) 

for  the  like  statistics  of  the  photocurrent.  Here  we  see  the 
well-known  shot-noise  lower  limit  of  semiclassicai  photode¬ 
tection,  i.e., 

S,,{/>  >  ?  <0.  (21) 

with  equality  at  ail  frequencies  only  if  P.  equals  (P  with 
probability  one. 

In  the  open-loop  quantum  theory,  the  event  times  under¬ 
lying  i,  and  N,  form  a  SEPP  whose  fcth-order  MCR  is 
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For  a  stationary  point  process  u  ;  is  a  constant  and  w»  de¬ 
pends  only  on  the  difference  between  its  arguments.  In  this 
case  the  mean  photocurrent  is 

(i)  -  qu  ,.  (13) 

and  its  noise  spectrum  (bilateral  covariance  spectrum)  is 

S  .(/")=  q  <i)  +  q:  j  dr[u  _,(T)  —  u  |"]exp(— i2rr/r).  (14) 

Equations  <  10)— ( 12 1  reduce  to  the  familiar  Poisson  limits 
when  the  MCR's  are  given  by  Eq.  t§).  Their  behavior  for 
DSPP's  and  SEPP's  forms  the  core  of  our  overview  of  open- 
loop  photodetection. 

B.  Open-Loop  Photodetection 

In  the  open-loop  semiclassicai  theory,  the  event  times  un¬ 
derlying  both  i:  and  .V-  form  a  DSPP  with  random  rate 
function  given  by 

\.  =  u  P.'hi...  (15) 

in  terms  of  the  detector  quantum  efficiency  i;.  the  (possibly 
randomt  power  illuminating  the  detector  P:,  and  the  photon 
energy  hi,,.  Equations  1 9 1  and  ilOt  then  yield  Mandel's 
rule1'  for  the  count  probability  distribution 


where  p  is  the  density  operator  for  the  field.  £(x.  f) 
exp(  —  1 2 7r )  is  the  positive-frequency  photon-units  field 
operator  for  x  =  (x,  y)  in  the  plane  of  the  photodetector.1’9 
and  .4,j  represents  the  active  area  of  the  photodetector. 
When  p  is  a  classical  state,  it  has  a  proper  P  representation, 
namely, 

p=  f  dJa  P(t>;  or*)lo)  (a!  (23) 


for  P  a  classical  probability  density  and  lot)  the  multimode 
coherent  state.  Equation  (22)  then  becomes 

ic,(t,.  r2 . rk)  =  |  d'aPta;  a* )  J  j  I  nP.  (24) 

where 

P,  =  hr,.(al  [  dxf’tx.  f)£'x.  Mia  ■  (25) 


Pri.V.  =  m  = 
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is  a  nonnegative  classical  stochastic  process  whose  statistics 
are  specified  by  P(a:  a*),  and  all  the  semiclassicai  formulas 
are  reproduced.  States  that  do  not  have  proper  P  represen¬ 
tations  are  called  nonclassical.  Their  photocount  and  pho¬ 
tocurrent  statistics  require  SEPP's  that,  in  general,  are  not 
DSPP's.  For  example,  if  p  is  a  photon  eigenstate  that  has 
exactly  .V  photons  within  the  space-time  region  ,40  *  [0.  Tl, 
then  Eqs.  ( 10)  and  (22)  yield  the  binomial  distribution-  ■'■JU 


Pri.V-  =  n)  =  ^  ^  J7n ( 1  —  it)'" 


W  3  !  dr*P.  he...  ( IT  • 

It  is  well  known  thai  this  distribution  does  not  permit  sub- 
Poisson  behavior.  viz  .  trom  Eq-  1 9 1  ■ !  1 1. 1 12).  ( 15).  1 17 ). 

van  A  i  =  V  -  car’ll"1  5  A  '.  il'1 

with  eqw.iiits  il  .mu  ,1111  i!  H  -  nonrandom.  It  P  is  a 


for  n  =  o.  1.  2. . V  This  distribution  is  sub-Poissonian 

for  all  "  <  rj  <  1  and  collapses  to  the  expected  nonrandom 
behavior  as  the  quantum  efficiency  goes  to  unite  1 

Sub-Poisson  photocounts  and  sub-shot-noise  photocur¬ 
rent  spectra  provide  unmistakable  signatures  tor  nonclassi¬ 
cal  light  in  open  loop  photodetection  Such  elfects  have 
now  been  ob-eneri  in  a  varietx  oi  experiments  Short  and 
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Mandel10  produced  conditionally  sub-Poisson  light  by 
means  of  resonance  fluorescence  from  sodium  vapor.  Their 
experiment  required  that  the  starting  time  of  the  photo¬ 
counting  interval  coincide  with  the  entry  of  a  single  atom 
into  the  field  of  view  of  the  apparatus  and  that  the  light  be 
generated  only  by  a  single  atom.  Subsequently,  Teich  and 
Saleh"  produced  unconditionally  (continuous-wave)  sub- 
Poisson  light  by  using  a  space-charge-limited  electron  beam 
to  excite  the  Franck-Hertz  effect  in  mercury  vapor.  More 
recently,  Slusher  et  al.,12  Shelby  et  al.,13  and  Maeda  et  al.li 
have  seen  sub-shot-noise  photocurrent  spectra  in  homodyne 
detection  of  light  that  has  undergone  four-wave  mixing,  as 
have  Wu  et  al.]*  in  a  three-wave  mixing  experiment.  It  is 
relevant  to  note  that  self-exciting  point  processes  play  a  role 
in  all  these  cases.  In  the  Short-Mandel  experiment,  the 
self-excitation  is  provided  by  the  dead  time  associated  with 
successive  atomic  emissions.10  In  the  Teich-Saleh  experi¬ 
ment,  it  is  provided  by  the  space  charge  associated  with  the 
electron  beam.  Indeed,  they  modeled32  the  statistics  of  this 
beam  as  a  renewal  point  process,  which  of  course,  is  a  special 
kind  of  SEPP.  Self-excitation,  in  the  multiwave  mixing 
experiments,  can  be  attributed  to  the  quantum  correlations 
between  the  pump  and  probe  beams  produced  by  photon 
exchanges  occurring  through  the  nonlinear  optical  process. 

3.  CLOSED-LOOP  PHOTODETECTION 

Most  theoretical  results  for  open-loop  systems  do  not  carry¬ 
over  to  closed-loop  photodelection.  We  proceed  now  to 
reexamine  the  semiclassical  and  quantum  photodetection 
properties  of  i.  and  -V,  when  the  output  of  the  photodetector 
is  permitted  to  affect  the  light  at  its  input  through  a  causal, 
but  possibly  nonlinear,  feedback  loop,  as  shown  in  Fig.  3.  In 
both  the  semiclassical  and  quantum  formulations  it  then 
turns  out  that  the  photodetection  event  times  form  a  SEPP 
Semiclassicallv.  this  is  most  easily  seen  through  the  incre¬ 
mental  point-process  description  [Eqs.  (3>— < 5 *] .  In  the  ab- 


ready  requires  SEPP  descriptions;  passing  to  the  closed  - 
loop  configuration  modifies  many  of  the  detailed  results  but 
does  not  break  out  of  the  general  SEPP  structure.  For 
example,  in  the  open-loop  configuration  it  is  known45  that 
the  photocurrent  t,  realizes  the  quantum  measurement i; 

l.  s  g  dx£'\x. ')£’< x.  n,  (29) 

U, 

with 

£'(x.  t)  *  i)I/2£(x.  t)  +  (l  -  ij)I/2£t (x,  r ),  (30i 

where  ^.(x,  r)  is  a  vacuum-state  field  operator  representing 
subunity  quantum  efficiency  noise.  In  Eqs.  (29)  and  1 30 1 , 
£(x,  t)  and  £t(x.  t)  are  commuting  free-field  operators  that 
do  not  explicitly  depend  on  |i.:r  <  r|.  When  the  loop  is 
closed,  Eqs.  (29)  and  (30)  continue  to  apply  but  with  £i  x.  f  i 
explicitly  dependent  on  past  measurements. 

Because  both  the  semiclassical  and  quantum  theories  lead 
to  SEPP  s,  there  are  no  unmistakable  signatures  for  nonclas- 
sica!  light  in  the  closed  loop.  To  be  specific,  for  a  SEPP  the 
quantity  unit,  s)  -  uqtOuqfs)  need  not  be  positive  semide- 
finite,  so  that  [see  Eqs.  (12)  and  (14)]  sub-Poisson  photo- 
counts  and  sub-shot-noise  photocurrent  spectra  are  possible 
in  both  theories.  In  what  follows,  we  shall  explore  this 
behavior  for  two  feedback  configurations,  which  we  term  the 
dead-time-modified  Poisson  process  (DTMPP)  and  the  neg¬ 
ative-linear-feedback  process  (NLFP). 

A.  Dead-Time-Modified  Poisson  Process 
It  has  long  been  known  that  semiclassical  photocounting 
statistics  are  impacted  by  dead  time  in  the  detection  system 
Consider  the  nonparalyzable  DTMPP.  which  is  a  renewal 
process  and  hence  a  SEPP.17  3*' 15  For  a  detector  with  fixed 
dead  time  r„,  illuminated,  in  the  semiclassical  theory,  by 
nonrandom  light  of  constant  power  P.  the  photocount  prob¬ 
ability  distribution  isr'-16 
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sence  of  feedback  we  have  a  DSPP.  i.e.,  the  semiclassical 
conditional  rate  obeys 

n.*{7iP..h i-nlt„ -V...  (27) 

where  P  .  the  illumination  power,  is  a  nonnegative  random 
process  that  is  not  directly  influenced  by  the  point  process, 
viz..  P  is  conditionallv  independent  of  the  photocount  histo¬ 
ry  It*.  .V.;  given  its  own  history  \P.  -  <  ti.  Closing  the  feed 
back  loop  ieaces  Eq.  <27i  tormally  intact  but  makes 

P .  -  P  i:t  .  .V.;i  1 2y • 

an  explicit  Junction  ol  the  photocount  historx  Thus  the 
DSPP  condit.on  is  violated,  indicating  that  closed-loop 
.-emu ijssu a.  pnoiodetection  leads,  in  general,  to  ,i  SEPP 
Quantum  mechanic  aid.  ..pen-loop  photodetection  ai 


where 


Pnt*.  a  i  3  a'e'Vfc!.  A  =  i|P/hi0.  (32) 

These  results  are  valid  for  a  detector  that  is  unblocked  at  the 
beginning  of  each  counting  interval,  although  exact  number 
distributions  for  counters  that  are  blocked  and  for  equilibri- 
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The  channel  capacity  C  of  a  light-wave  communication  system  based  on  photoevent  point-process  ob¬ 
servations  cannot  be  increased  by  use  of  photon-number-squeezed  light.  Under  a  constraint  of  max¬ 
imum  photon  rate  Xm„.  the  channel  capacity  C mkm,Je  is  achieved  with  a  Poisson  process  On  the  other 
hand,  the  channel  capacity  of  a  communication  system  based  on  photon  counting  can  be  increased  by  use 
of  photon-number-squeezed  light  The  improvement  vanishes  in  the  limit  of  very  small  mean  counts 

PACS  numbers  42  50  Ds 


All  light-wave  communication  systems  that  have  been 
developed  to  date  make  use  of  Poisson  (or  super-Poisson) 
light.1  For  Poisson  light,  the  variance  of  the  photon 
number  is  identically  equal  to  its  mean  for  all  values  of 
the  counting  time  T  Photon-number-squeezed  light,  on 
the  other  hand,  has  a  photon-number  variance  that  is  less 
than  its  mean  for  all  or  some  values  of  T  2  i  Such  light 
is  intrinsically  nonclassical  in  nature.  The  earliest  source 
of  unconditionally  photon-number-squeezed  (also  called 
sub-Poisson)  light  exhibited  only  a  slight  reduction  of  the 
variance.4  Stronger  photon-number  squeezing  has  been 
produced  in  more  recent  experiments5,6  and  continuing 
developments7'9  promise  further  improvement  in  the  fu¬ 
ture.  It  is  therefore  of  interest  to  investigate  whether 
there  might  be  an  advantage  to  using  photon-number- 
squeezed  light  in  a  direct-detection  light-wave  communi¬ 
cation  system 

There  are  two  classes  of  mechanisms  by  means  of 
which  unconditionally  photon-number-squeezed  light 
may  be  generated  In  the  first  class,  squeezed  photons 
are  produced  from  a  beam  of  initially  Poisson  (or 
super-Poisson)  photons  This  can  be  achieved  in  a  num¬ 
ber  of  ways,  e  g  ,  by  the  use  of  correlated  photon  beams 
or  quantum  nondemohtion  (QND)  measurements. 10  An 
experiment  of  this  kind  was  recently  carried  out  by  Rari¬ 
ty.  Tapster,  and  Jakemen  "  Sub-Poisson  photons  were 
generated  from  the  pair  of  correlated  photon  beams  pro¬ 
duced  in  parametric  down-conversion;  one  of  the  twin 
beams  was  used  to  gate  the  other  beam  selectively  via 
dead-time  control. 

The  second  class  of  mechanisms  relies  on  the  direct 
generation  of  squeezed  photons  from  a  beam  of  initially 
sub-Poisson  excitations  (eg.  electrons)  3  i:  This  tech¬ 
nique  was  first  used  by  Teich  and  Saleh  in  a  space- 
charge-limited  version  of  the  Franck-Hertz  experiment  4 


Perhaps  the  simplest  implementation  of  this  principle  is 
achieved  by  the  driving  of  a  light-emitting  diode  with  a 
sub-Poisson  electron  current.6  8 

In  this  Letter  we  show  that  the  channel  capacity  of  a 
light-wave  communication  system  based  on  the  observa¬ 
tion  of  the  photoevent  point  process  cannot  in  principle 
be  increased  by  the  use  of  photon-number-squeezed 
light.  On  the  other  hand,  the  channel  capacity  of  a 
photon-counting  system  can  be  increased  by  the  use  of 
photon-number-squeezed  light.  The  channel  capacity  is 
the  maximum  rate  of  information  that  can  be  transmu¬ 
ted  through  a  channel  without  error. 13  We  also  provide 
an  example  in  which  the  use  of  pholon-numbcr- 
squeezed  light  produced  from  Poisson  light  either  de¬ 
grades  or  enhances  the  error  performance  of  a  simple 
binary  on-oflf  keying  photon-counting  system,  depending 
on  where  the  average  power  constraint  is  placed 

Consider  the  transformation  of  a  Poisson  beam  of  pho¬ 
tons  (represented  by  a  Poisson  point  process10  N,  of  rate 
p, )  into  a  sub-Poisson  beam  of  photons  represented  by  a 
point  process  M,  of  rate  X,.  The  events  of  the  initial  pro¬ 
cess  A,  are  assumed  to  be  observable  (e.g.,  by  the  use  of 
correlated  photon  beams  or  a  QND  measurement)  and 
their  registrations  used  to  operate  a  mechanism  which,  in 
accordance  with  a  specified  rule,  leads  to  the  events  of 
the  transformed  photon  process  M,.  The  rate  X,  of  the 
process  M,  is  thereby  rendered  a  function  of  the  realiza¬ 
tions  of  the  initial  point  process  A',  at  prior  times,  i.e.. 
X,  -X,(A,.;  I'si). 

Several  examples  of  transformations  of  this  kind  that 
have  been  suggested  for  use  in  quantum  optics  are  illus¬ 
trated  in  Fig.  1  and  discussed  below  It  is  assumed  for 
simplicity  (but  without  loss  of  generality)  that  the  vari¬ 
ous  conversions  can  be  achieved  in  an  ideal  manner. 

(i)  Dead -time  deletion  Delete  all  photons  within  a 
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(a)  Poisson 


(b )  Dead  -  time  deletion 
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(c)  Coincidence  decimation  - •— - - •-> 


(d)  Decimation 


(e)  Overflow  count  deletion  •-• - ••  • - ) 

(f) Rate  compensation  !«••••• — •  ••  — • - •-• ^ 
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FIG  l  Several  transformations  of  Poisson  photons  into 
sub-Poisson  photons  that  have  been  suggested  for  use  in  quan¬ 
tum  optics 


prescribed  fixed  (nonparalyzable)  dead  time  r  following 
the  registration  of  a  photon. 14  Rarity,  Tapster,  and 
Jakeman"  generated  photon-number-squeezed  light  by 
using  one  of  the  twin  beams  produced  in  parametric 
down-conversion  to  gate  photons  selectively  from  the 
other  beam  via  dead-time  control.  Dead-uinc  deletion 
could  also  be  used  with  correlated  photon  beams  pro¬ 
duced  in  other  ways. 

(ii)  Coincidence  decimation  Remove  all  pairs  of  pho¬ 
tons  separated  by  a  time  shorter  than  a  prescribed  time 
interval  r'.  This  is  achieved,  for  example,  in  second- 
harmonic  generation;  two  photons  closer  than  the 
intermediate-state  lifetime  of  the  second-harmonic- 
generation  process  are  exchanged  for  a  third  photon 
(which  is  at  twice  the  frequency  and  therefore  easily 
eliminated). 15 

(iii)  Decimation:  Select  every  r th  photon  (/•“ 2, 
3, . .  .)  of  an  initially  Poisson  photon  process,  deleting  all 
intermediate  photons.  Saleh  and  Teich16  suggested  us¬ 
ing  correlated  photon  beams  to  implement  this  tech¬ 
nique  In  cascaded  atomic  emissions  from  40Ca,  for  ex¬ 
ample,  sequences  of  correlated  photon  pairs  (green  and 
violet)  are  emitted.  The  green  photons  can  "be  detected 
and  used  to  operate  a  gate  that  passes  every  rth  violet 
photon.  Decimation  control  could  also  be  used  in  con¬ 
junction  with  parametric-down-conversion  photon  twins. 

(iv)  Overflow  count  deletion  The  number  of  pho¬ 
tons  occurring  in  preselected  time  intervals  lO.Tol. 

[To^Tol . is  counted,  retaining  the  first  n o  photons 

in  each  time  interval  (without  changing  their  occurrence 
times)  and  deleting  the  remainder.  If  the  average  num¬ 
ber  of  photons  in  [0,7ol  of  the  initial  process  is  »/to. 
then  the  transformed  process  will  almost  always  contain 
n o  photons  within  this  time  interval  As  an  example, 
Mandel17  suggested  that  if  a  collection  of  no  atoms  in 
the  ground  state  are  subjected  to  a  brief,  intense,  in¬ 
coherent  excitation  pulse,  all  n0  atoms  will  become  excit¬ 
ed  with  high  probability;  the  radiated  optical  field  would 


then  be  describable,  to  good  approximation,  by  an  no¬ 
photon  state.  Related  schemes  have  been  proposed  by 
Yuen18  and  by  Stoler  and  Yurke19  for  use  with  paramet¬ 
ric  processes. 

We  proceed  to  illustrate  that  none  of  these  mod¬ 
ifications  can  increase  the  channel  capacity  of  a  com¬ 
munication  system  based  on  photoevent  point-process  ob¬ 
servations. 

If  a  constraint  is  placed  on  the  rate  of  the  initial  Pois¬ 
son  process  p,  <  then  it  is  obvious  that  C  cannot  be 
increased  by  the  modification  A,—*  M,.  This  is  simply  a 
consequence  of  the  definition  of  channel  capacity  :  It  is 
the  rate  of  information  carried  by  the  system  without 
error,  maximized  over  all  coding,  modulation,  and 
modification  schemes.  Can  the  modification  A’,  — •  M, 
increase  the  channel  capacity  if  the  constraint  is  instead 
placed  on  the  rate  of  the  modified  process  A,  (i.e  . 
A;  —  Am*x  )  9 

We  address  this  question  for  an  arbitrary  self-exciting 
point  process  M,  of  rate  A, (A/,.;  r'<  t).  This  is  a  pro¬ 
cess  that  contains  an  inherent  feedback  mechanism  in 
which  present  event  occurrences  are  affected  by  the  pre¬ 
vious  event  occurrences  of  the  same  point  process.  Of 
course,  the  modified  Poisson  processes  A’,  —  M ,  intro¬ 
duced  above  are  special  cases  of  self-exciting  point  pro¬ 
cesses. 

An  example  of  a  system  that  generates  a  self-exciting 
point  process  is  that  of  rate  compensation  (by  linear 
feedback)  of  a  source  which,  without  feedback,  would 
produce  a  Poisson  process.  Let  each  photon  registration 
at  time  t,  cause  the  rate  of  the  process  to  be  modulated 
by  a  factor  h(t—t ,)  (which  vanishes  for  t<t,).  In 
linear  negative  feedback  the  rate  is  A,  “Ao~  £,  hU  —  I, ). 
where  Ao  is  a  constant.  If  the  instantaneous  photon 
registration  rate  happens  to  be  above  the  average  then  it 
is  reduced,  and  vice  versa.  This  process  is  schematically 
illustrated  in  Fig.  1(f)  for  two  adjacent  subintervals  T t 
and  T 2-  Yamamoto,  Imoto.  and  Machida70  suggested 
the  use  of  rate  compensation  in  conjunction  with  a  QND 
measurement  (using  the  optical  Kerr  effect),  but  it  could 
be  used  just  as  well,  for  example,  with  correlated  photon 
pairs.  Dead-time  deletion  can  be  viewed  as  a  special 
case  of  rate  compensation  in  which  the  occurrence  of  an 
event  zeros  the  rate  of  the  process  for  a  specified  time 
period  r  after  the  registration. 10 

Now  consider  a  communication  system  that  uses  a 
point  process  M,( X)  whose  rate  A,(,Y)  is  modulated  by  a 
signal  X,.  The  process  M,(X)  can  be  an  arbitrary  self¬ 
exciting  point  process  (e  g.,  it  can  be  photon-number 
squeezed)  which  includes  processes  obtained  by  the  feed¬ 
forward  or  feedback  modification  of  an  otherwise  Poisson 
process.  Neither  feedforward  nor  feedback  transforma¬ 
tions  can  increase  the  capacity  of  this  channel,  as  provid¬ 
ed  by  Kabanov's  theorem  :i 

Kabanov's  theorem — The  capacity  of  the  point- 
process  channel  cannot  be  increased  by  feedback  Under 
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the  constraint  X,  <  Xma!l,  the  channel  capacity  C  is 

C  (1) 

When  the  capacity  is  achieved,  the  output  of  the  point- 
process  channel  is  a  Poisson  process  with  rate  X,  “Xma„/e 
(the  base  e  has  been  used  for  simplicity).  The  channel 
capacity  has  also  been  determined  under  added  con¬ 
straints  on  the  minimum  rate  (dark  events)  and  on  the 
mean  rate.21  A  coding  theorem  has  also  been  proved. 

In  summary,  no  increase  in  the  channel  capacity  of  a 
point-process  light-wave  communication  system  may  be 
achieved  by  the  use  of  photons  that  are  first  generated 
with  Poisson  statistics  and  subsequently  converted  into 
sub-Poisson  statistics  regardless  of  whether  the  power 
constraint  is  placed  at  the  Poisson  photon  source  or  at 
the  output  of  the  conversion  process.  Nor  may  an  in¬ 
crease  in  channel  capacity  be  achieved  by  the  use  of 
feedback  to  generate  a  self-exciting  point  process 
These  conclusions  are  valid  only  when  there  are  no  re¬ 
strictions  on  the  receiver  structure  The  conclusion  is 
different  if  the  receiver  is  operated  in  the  photon¬ 
counting  regime,  in  which  information  is  carried  by  a 
random  variable  n  representing  the  number  of  photo¬ 
events  registered  in  time  intervals  of  prescribed  duration 
T  (rather  than  by  the  photon  occurrence  times). 

The  capacits  of  the  photon-counting  channel  is  given 
by13 

C  mB\h  ln(  1  +  \/h )  +  Inf  I  +h )].  (2) 

where  h  is  the  mean  number  of  counts  in  T  and 
is  the  bandwidth  Two  limning  expressions  emerge 

jfln  ln(  \/h ).  h  «  I , 

^  j/?ln(« ).  h  -•>  I 

If  an  added  constraint  is  applied  to  the  photon  counts, 
such  that  they  must  obey  the  Poisson  counting  distribu¬ 
tion,  the  capacity  is  further  reduced  In  that  case,  the 
limiting  results  analogous  to  Eq  (3)  are 

ffln  In ( \/h ).  h  <'  i , 

^  |  }  Sln(n),  h  »  I.  ^ 

In  the  case  of  photon  counting,  therefore,  an  increase  in 
the  channel  capacity  can  in  principle  be  realized  by  the 
use  of  photon-number-squeezed  light.  However,  in  the 
small  mean-count  limit  h  «  I  (very  short  T),  the  capaci¬ 
ty  of  the  Poisson  counting  channel  approaches  that  of  the 
unrestricted  counting  channel,  and  the  advantage  of 
photon-number  squeezing  disappears  This  is  not  unex¬ 
pected  in  view  of  the  result  obtained  from  Kabanov's 
theorem  for  the  point-process  channel 

The  channel  capacity  provides  a  limit  on  the  max¬ 
imum  rate  of  error-free  information  transmission  for  all 
codes,  modulation  formats,  and  receiver  structures. 13  As 
such,  it  does  not  specify  the  performance  (error  probabil¬ 
ity)  achievable  by  a  communication  system  with  pre¬ 


scribed  coding,  modulation,  and  receiver  structure. 

It  is  therefore  of  interest  to  examine  the  performance 
of  a  system  with  specified  structure.  We  consider  a 
binary  on-off  keying  photon-counting  system.1  The  in¬ 
formation  is  transmitted  by  the  selection  of  one  of  two 
values  for  the  photon  rate  X,,  in  time  slots  of  (binary¬ 
digit)  duration  T.  The  receiver  operates  by  counting  the 
number  of  photons  received  during  the  time  interval  T 
and  then  deciding  which  rate  was  transmitted  in  accor¬ 
dance  with  a  likelihood-ratio  decision  rule  (threshold 
test).  For  simplicity,  it  is  assumed  that  background 
light,  dark  noise,  and  thermal  noise  are  absent  so  that 
photon  registrations  are  not  permitted  when  the  keying  is 
OFF  (i.e.,  false  alarms  are  not  possible).  Furthermore, 
the  detector  quantum  efficiency  is  taken  to  be  unity  so 
that  system  performance  is  limited  only  by  the  quantum 
fluctuations  of  the  light. 

A  measure  of  performance  for  a  digital  system  such  as 
this  is  the  error  probability  Pe.  In  the  simplified  system 
described  above,  errors  are  possibly  only  when  the  keying 
is  ON  and  0  photons  are  received  (a  miss).  For  a  Pois¬ 
son  transmitter.  P,  is1 

P,  (Poisson)  ”  |  exp(  -  h).  (5) 

where  h  denotes  the  mean  number  of  emitted  photons. 
To  minimize  Pr.  n  is  made  equal  to  its  maximum  allowed 
value  fim,.  This  result  is  now  compared  with  those  ob¬ 
tained  for  photon-number-squeezed  light  derived  from 
an  initially  Poisson  source.  The  outcome  will  depend  on 
where  the  mean  photon-number  constraint  is  placed. 
Two  transformations  are  explicitly  considered  dead¬ 
time  deletion  and  decimation 

(i)  Dead-time  deletion  For  a  nonparaly zablc  dead¬ 
time  modifier  that  is  always  blocked  for  a  dead-time 
period  r  at  the  beginning  of  the  counting  interval  T,  the 
passage  of  0  photons  arises  from  the  emission  of  0  pho¬ 
tons  in  the  time  T—  r,  independent  of  the  number  of 
emissions  during  r.  The  error  probability  for  this  system 
is  therefore 

P,(dead  time)  **  j  expl  —  n(l  —  x/T))  (6) 

To  minimize  error  under  the  constraint  h  s  hw,,.  we 
take  n“nm,»  The  error  is  obviously  larger  than  that 
for  the  Poisson  channel  [Eq  (5))  and  so  no  performance 
enhancement  can  be  achieved  by  use  of  this  modifier.  If. 
instead,  the  dead-time  modifier  is  always  unblocked  at 
the  beginning  of  each  bit  then  the  passage  of  0  photons 
arises  from  the  emission  of  0  photons  in  the  time  T,  and 
the  dead  time  has  no  effect  on  the  error  rate  in  this  sim¬ 
ple  system  Calculations  for  the  unblocked  counter  in 
the  presence  of  false  alarms,  however,  demonstrate  that 
the  presence  of  deal  time  always  does,  in  fact,  degrade 
system  performance  with  such  a  constraint.22  Although 
the  detailed  calculations  were  carried  out  for  electrical 
dead  time,  the  results  are  also  applicable  for  optical  dead 
time  when  the  photon  detection  efficiency  r?  “  I  On  the 
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other  hand,  if  the  constraint  is  placed  on  the  mean  pho¬ 
ton  count  m  after  dead-time  modification  (m  <  it 

can  be  shown  that  there  exists  a  value  of  rnm„  below 
which  performance  is  degraded  and  above  which  perfor¬ 
mance  is  improved,  relative  to  the  Poisson  channel. 

(ii)  Decimation :  We  assume  that  the  decimation  pa¬ 
rameter  r  —  2  (i.e.,  every  other  photon  of  a  Poisson  se¬ 
quence  of  events  is  selected)  and  that  the  decimation 
process  is  reset  at  the  beginning  of  each  bit  (i.e.,  the  first 
photon  in  each  bit  is  not  selected)  The  error  probability 
is  then 

P, (decimation )  "  j  ( I  +  n  )exp(  —  n ).  (7) 

which  again  represents  a  degradation  of  performance  in 
comparison  with  the  Poisson  channel  (under  a  constraint 
n  S  nm„)  In  this  case,  the  error  rate  is  increased  be- 
cauve  there  are  two  ways  for  the  passage  of  0  photons  to 
arise  in  the  time  T  from  the  emission  of  0  photons  or 
from  the  emission  of  I  photon  However,  if  the  con¬ 
straint  is  placed  on  the  modified  process,  then,  once 
again,  there  exists  a  value  of  mm„  below  which  perfor¬ 
mance  is  degraded  and  above  which  it  is  improved,  rela¬ 
tive  to  the  Poisson  channel 

It  is  evident  from  these  examples  that  system  per¬ 
formance  can  be  enhanced  by  the  use  of  photon- 
number-squeezed  light,  provided  that  the  average  power 
constraint  is  applied  to  the  squeezed  light.  No  enhance¬ 
ment  of  system  performance  emerges  in  converting  Pois¬ 
son  photons  into  squeezed  photons  when  the  average 
power  constraint  is  at  the  Poisson  source. 

Losses  have  been  ignored  in  the  considerations  above 
It  is  important  to  keep  in  mind  the  inevitable  random 
photon  deletion  that  results  from  absorption,  scattering, 
and  the  finite  quantum  efficiency  of  the  detector.  It  is 
well  known  that  these  deletions  will  transform  a  deter¬ 
ministic  photon  number  into  a  binomial  photon-number 
distribution,1123  which  always  remains  sub-Poisson  but 
approaches  the  Poisson  boundary  as  the  random  deletion 
increases.23  Mandel'1  has  shown  that  the  information 
rate  per  symbol  carried  by  such  a  counting  channel  will 
be  greater  than  that  for  the  Poisson  channel,  but  will  ap¬ 
proach  the  latter  as  the  photon-detection  probability  n 
approaches  0.  We  have  shown  elsewhere24  that  the 
performance  of  a  simple  binary  on-off- keying  photon¬ 
counting  system,  of  the  kind  considered  earlier,  is  also 
superior  for  a  binomial  source  than  for  a  Poisson  source, 
approaching  the  latter  as  rj  approaches  0. 
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Fljd  =  -~K,  (si 
ds : 


M  j(x  )  =  —  — -K,ls) 
ds ' 


This  simple  pattern  does  not  persist  beyond  third  order, 
however. 

Consider  a  primary  process  p  characterized  by  the 
MGF  Qrts),  whose  members  independently  initiate  a 
branching  process  resulting  in  an  overall  multiplication 
m  described  by  Q„<s  I.  The  moment-generating  function 
of  the  overall  compound  process  mixes  the  above  via 

Qc<s)  =  QA-lnQm(sti  ,  <B5) 


which,  using  Eq.  (B3),  can  be  rewritten  in  terms  of  the 
associated  CGF's  as 

K({s)  —  Kp[ — Afm(sl)  .  (B6i 

Differentiating  both  sides  according  to  Eq.  (B4),  and  ex¬ 
panding  via  the  chain  rule,  leads  to  the  general  relations 

(rtc  )  —  (p  )  ( m  )  ,  (B7a 1 

V(nr  )  =  Vtpiim  ):+  (p  )Vim  )  ,  (B7b i 

M }inc  )  =  M  t(p)(  m  >’  +  3  F(p)(  m  )  V Im  )  +  (p  )Ml{m )  , 


which  may  be  applied  to  any  simple  compound  process 
where  a  primary  counting  distribution  P\p)  drives  a 
multiplication  counting  distribution  Pint). 
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s  j 
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;  '  j 
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*.151=  £  KfS)  . 

!  I 
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Fie  4.  (al  Closed-loop  photocounting  experiment  carried  out  by 
Walker  and  -Jakeman.19  ibi  Photocounting  experiment  carried  out 
bv  Teich  and  Vannucci.'fi 


um  counters  are  also  available.  In  the  usual  situation,  the 
mean  count  is  much  greater  than  unity,  in  which  case  the 
differences  arising  from  the  three  initial  conditions  are  in¬ 
substantial  and  a  simple  approximation  for  the  photocount 
distribution  suffices  !P  The  photocount  mean  and  variance 
then  take  the  asymptotic  forms 

(,V./  =  ,\f/(l  +  \r.)  (33) 


vari.V.i  =  >  ,Y  >/U  +  ,\r 0i\  <34 * 

representing  sub-Poisson  behavior  for  all  values  of  \ra. 

The  DT.MPP  results  are  relevant  to  experiments  recently 
carried  out  by  Walker  and  Jakeman.''"  The  simplest  form  of 
their  experimental  arrangement  is  illustrated  in  Fig.  4(a). 
The  registration  of  a  photoevent  at  the  detector  operates  a 
trigger  circuit  that  causes  an  optica!  gate  to  be  closed  for  a 
fixed  period  of  time  after  the  time  of  registration.  During 
this  period,  the  power  P:  of  the  (He-Ne)  laser  illuminating 
the  detector  is  set  precisely  equal  to  zero  so  that  no  photode¬ 
tections  are  registered.  The  arrangement  is  therefore 
equivalent  to  the  one  illustrated  in  Fig.  4(b),  in  which  the 
gating  is  electronic  rather  than  optical,  at  least  so  far  as  the 
photocount  statistics  are  concerned.  This  latter  arrange¬ 
ment  was  used  hv  Teich  and  Vannucci.w  and  sub-Poisson 
photocounts  were  observed  in  both  cases.  This  is  because 
the  point  process  seen  bv  the  counter  in  these  experiments  is 
the  DTMPP  considered  above.  As  Walker  and  -Jakeman 
understood,  their  observations  can  he  explained  without  re- 
ioiirse  lo  'he  quantum  theory  of  photodetection;  under 
oio»ed-loop  .  onditions.  -ub-Poisson  photocounts  are  possi¬ 
ble  within  the  -emu  i.is-u  jl  tramework  Nevertheless,  it  is 
in-triu  me  develop  ’he  quantum  Inundations  for  Eq- 
■  anil  I  ,n  a  manner  >  muiui  ive  to  generalization. 

'  •  1 1- 1- ; •  r  ■ : - 1 '  1. 1 1 : '  or.  ■.  er -mil  n|  a  -omewh.it  rn  her  dead 


i  ,n  a  manner  i  omlucive  ' 
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time  experiment  shown  in  Fig  5.  Here,  a  coherent-state 
signal  field  £s(x,  0  with  mean 

<«l£s(x.r)l«>  -  (P/hv^)1*  (35) 

illuminates  an  in-loop  photodetector  of  quantum  efficiency 
n  through  a  feedback-controlled  flip  mirror.  For  sec  after 
each  in-loop  photocount  registration,  the  flip  mirror  directs 
£s(x,  t)  to  an  out-of-loop  photodetector  of  matched  quan¬ 
tum  efficiency  rp  During  this  dead-time  interval,  the  in¬ 
loop  detector  is  illuminated  by  a  vacuum-state  field  operator 
£i u(x.  t ).  After  this  dead-time  interval  the  mirror  returns  to 
its  previous  position,  in  which  £$  illuminates  the  in-loop 
detector  and  £M  illuminates  the  out-of-loop  detector. 

We  derive  the  steady-state  first  and  second  moments 
(means,  variances,  and  covariance)  of  the  in-loop  and  out-of- 
loop  photocount  records  N,  and  N,'  m  the  high-mean-count 
limit.  T oward  that  end,  we  recast  the  flip-mirror  subsystem 
into  the  explicit  lossless  modulated  beam-splitter  form, 
shown  in  Fig.  6,  in  which 

£1N(x.  t)  -  [T(t)]'/J£s(x,  f)  +  [1  -  T(t)]1/2£M(x.  t)  (36) 
and 


Fig  5  Quantum-photodetection  configuration  that  can  be  used  to 
represent  the  DTMPP  experiments  carried  out  by  Walker  and  Jake- 
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£0VT<x,t)  =  -[i  -  T(£))1,:£s(x.n  +  [rtf)]1  -£v(x.o.  (37) 
with  beam-splitter  transmission  given  by 

Tit)  =  1  —  <j_1  f  i,hd(t  —  r)d7  (38) 


in  terms  of  the  causal  linear  system  impulse  response 
for  0  <  f  <  rd 
otherwise 


hM) 


{o 


(39) 


and 


(A ',)  *  I?'1  <Ir>dr  =  TJ  <£‘in)2j4<j£ 

Jo 

*=  \t/(l  +  \Td)  (51) 


(JV,')  =  Q  1  f  (t/>dr  *  rj  (£0LT):Adt 
Jo 

-  At  -  Xt/(1  +  \rd),  (52) 


Because  i,  has  an  impulse  of  area  q  at  each  in-loop  photo¬ 
event  and  because  such  events  are  impossible  when  the  in- 
loop  detector  is  illuminated  by  the  vacuum-state  field  it 
is  easily  verified  that  Eqs.  (36)— (39)  correctly  describe  the 
flip-mirror  arrangement. 

To  proceed  further,  we  replace  the  classical  in-loop  and 
out-of-loop  photocurrents  i,  and  i/  by  their  associated  oper¬ 
ator  representations  [cf.  Eqs.  (29),  (30)]: 

i ,  *  q  I  dx£)N'*(x.  t)£1N'(x,  t)  (40) 

and 

i.  —  q  I  dx£0LT'f(x,  t)£0LT'(x,  f),  (41) 

1a. 

where 

£/(x.  tl  =  r,1  -E.lx.fl  +  (1  -  » ,)',3£;l<x.  t)  (42) 

for ;  =  IN.  OUT,  and  |£in,,  £oi  t.  I  are  vacuum-state  field 
operators.  We  next  employ  the  high-mean-count  condition 
to  justify  the  following  linearization  of  Eqs.  (361-138): 


where  X  =  rj/Vheo,  as  in  Eqs.  (31 )— (34).  Equation  (51)  repro¬ 
duces  Eq.  (33),  as  promised.  Equation  (52)  is,  in  fact,  self- 
evident  in  that,  by  construction,  JV,  +  JV,'  corresponds  to 
performing  quantum  efficiency  r)  photocounting  on  the  co¬ 
herent-state  field  £$■  Thus  .V,  +  JV,'  must  be  a  Poisson 
process  of  rate  X.  The  less  obvious  results  concern  the  sec¬ 
ond  moments,  with  which  we  deal  below. 

By  using  the  high-mean-count  linearization  we  know  that 


var(JV,)  *  4rjy4a(£,,N->2  ^  j  drA£1N1'(T) 
var(JV,')  *  4ip4(J<£orr>:  ^  J  dr.l£0L.TI'(r)  \ 


(53) 


(54) 


and 


COv(JV,,  JV,')  *  4ijAd(£ls)  (£olt)  (  drA£|Si'(rl 


X 


drA£0l  TI 


(55 ) 


( £is '  *  <  T ' 1  ■)£. '  =  I  (  T)P/hi  ) 1  (43)  where 


-l£lslx.  ii:£|Nix.[)-  £]\ ) 

*  7V  -a£.ix.  n  +  1 1  -  <T))‘  -£„<x.  t) 


+  Ufin  2 

T 

(44) 

i 1 

'  -■  1  -  i  T- 

)!  -<£si. 

(45) 

.IT,X-  n 

-  -U  -  'T 

i1  -a£.ix.  t) 

+  (T)'  - 

X  £mix.  tl 

+  ].ifit)/2ll 

-<T))'  -’] 

)£„•>.  (46) 

{  T ! 

5  1  -  |  vA„\ 

£i \'*htf(t  — 

r)dr 

=  1  -  i)/ts>'£ 

IS  ‘  * 7 a • 

(47) 

a£.,'(()  s  Ad~'  -  j  dx  Re[£  '(x.  t)  -  <£  lx.  1 1  ]  (56) 

for  J  =  IN.  OUT  represents  the  field-quadrature  fluctua¬ 
tions  that  beat  with  the  strong  mean  fields  to  produce  the 
photocount  fluctuations.  From  formulas  (44)  and  1 46 1— < 50 ) 
we  have  that 

A£|N1'(t)  =  |i?/(l  +  Xrrf)]'  '’a£s1( t)+  |rjArrf/(l  +  Ara|]‘  ■ 

X  £s|,(t)  +  (1  -  tj)  1  -£,n.  ,(t)  -  [T)P  hl  ' 

X  J  dr  A£|N|  ’(  t  )hdl  t  —  ti  1 57) 


and 


and 


-iTiri  '  -  |  2c1  :(£|N)P.e 


dx[i)'  :a£,n(x.  r) 


ll  -  TJ)1  -’£is  IX.  rl)Wi.((  -  r id r  <48l 


A£0,t,(()  =  -|tXrrf/(l  +  Xrj)]1  -^£<](t) 

+  [a/tl  +  Ara>] 1  -’£w,lf  I  +  (1  -  Jj)‘  • 

X  £,„  T(1(f)  -  [tlP'lti  ,A\rc)'  -’] 

X  drA£|S|')rl)i  .it - i.  1 58 1 


The  equation-  lor  the  mean  values  are  easily  solved,  yielding 
*  ',\P'hf,.Aj  |1  +  laPr.'/ii,  *]!*  149) 

=  -[i  P  hi  „.4  .  i  -  i  £.s  ■]'•'.  i  .Vi  i 

l r-  ir.  which  i:  roilow1-  that 


where  the  time-dependent  quadrature  operators  are  ob¬ 
tained  trum  the  corresponding  space-time-dependent  t:eld 
operators,  as  in  Eq  1  56 1  Equations  1 57 1  and  1 58 1  are  linear 
leedback  lorms  that  are  easiK  voiced  in  means  of  Fourier 
tran-iormation.  i  e  with 
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a£in,'(/1  =  dtA£|NI'(t|exp(— i2ir/(), 


etc.,  we  can  show  that 


Shapiro  er  o. 

the  variance  of  the  counts  missed  because  of  dead  time:  it  is 
super-Poisson  for  all  Xrd  values.  As  a  consistency  check  on 
formulas  (68)  and  (69),  we  note  that  they  implv  that 


•^^iNi’lft  = 


Ml  +  Xrd)]1/-[A£S1(/Q  +  (Xrd)l'2£i<,(/)1  +  (1  -  u)1 -£INV1(/) 


1  +  (*iP/hi/0)Hd(f) 


AGOUTI '<ft  ~  -(bXr^/d  +  Xrd)]'  2±£si(f) 

+  [rj/d  +  \Trf)]1/2£wl(/> 

-Ml  -  r,)1'-£0LT,1(/l 


tfd(ft  =  expt  —  nr/7d)[sin(  r/rd)]/jr/  (62) 

is  the  frequency  response  associated  with  the  impulse  re¬ 
sponse  h„(M.  It  is  now  a  simple  matter  to  use  the  coherent- 
state  quadrature-fluctuation  statistics  for  a£Si,  £.wi,  £ini-i. 
and  £ovTl  i  to  prove  that 

<A£,,'(f  +  r)ASkl'(t))  =  f  d/S£  £  .(/)exp(t2rr/r),  (63) 


for  7,  k  =  IN,  OUT.  where  the  noise  spectra  are  as  follows: 

S£ls,  £„,■(/)  =  1/4 1 1  +  (rjP/hi'n)Ha(f)\2,  (64 

(ft  =  4'1  +  l(riP/hi-r,lHd(ftl2/4Xrdll 

+  (r)P//lr0)/fd(ft!:,  (65 


^W.,lrT  'ft  =  -|-)P'hc„(XTd)'  :]/T*</)S£|s|£|n|(P.  (66) 

The  behavior  of  the  spectra  (Eqs.  (64)  and  (65)]  is  illustrated 
in  Fig.  7. 

To  complete  the  second-moment  analysis,  we  note  that 
the  high-mean-count  limit  requires  that  t  »  rd  so  that  only 
the  low-frequency  behavior  of  the  preceding  noise  spectra 
contribute  to  the  variance  and  covariance  in  formulas  (53)- 
<55i.  We  then  find 

varl.V.)  *  4|Xf/il  +  Xt0)]S£|i_  £  (0) 

=  Xf/U  +  X7dU  =  t.V.)/(  1  +  Xrd):,  (67! 

vari.V.)  ^  4|XtXr  /( 1  +  Xr)]S£  F  (0) 

J  a  1  ^OlTl  ^OUTl 

=  (Xt,\Td.'(l  +  Xrfl  t][  1  +  Xrd/(  1  +  Xrd)-] 

=  <X')|1  +  Xre.(l  -e  Xrd)2],  (68) 


Covi.V.  .V.  )  *  4|  \M  Xr  .)'  VI 1  -  Xrd)|5£  £  <0> 

=  \t.\- ..(]  +  Xr.i  <69  * 

Formula  16T i  reproduces  the  semiciassical  dead-time  result 
Eq.  t.‘.4  it  is  sub- Poisson  lor  ail  values.  Formulas  >6si 
anti  ",;n  are  novel  '  In  dead  time  nariance.  formula  ifivi  i> 


varl.V,  +  V,’)  =  varW,)  +  varf.V/)  +  2  cov(.V,, X,')  =  Xr.  (70i 

in  agreement  with  the  previously  stated  physical  argument 
that  -V,  +  N,‘  must  be  a  Poisson  process  of  rate  X.  Moreover, 
even  though  we  gave  a  quantum  version  of  that  argument.  .V. 
+  .V,'  being  a  Poisson  process  of  rate  X  also  follows  from  the 
semiciassical  shot-noise  descriptions.39 

Before  proceeding  to  the  NLFP  case,  some  elaboration  on 
what  has  just  been  shown  is  in  order.  Both  the  semiciassical 
and  the  quantum  theories  of  photodetection  predict  the 
same  mean  and  variance  for  the  DTMPP  in-loop  count  re¬ 
cord  .V,.  This  agreement  is  not  coincidental.  The  semicias¬ 
sical  statistics  will  be  the  same  as  those  of  the  quantum 
theory  in  any  closed-loop  photodetection  arrangement  in 
which  breaking  the  feedback  loop  leaves  the  photodetector 
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illuminated  by  a  classical  state  [in  the  usual  open-loop  sense, 
cf.  Eq.  (23)).  In  other  words,  despite  our  use  of  operator 
representations  for  photocurrents,  the  actual  quantum  mea¬ 
surement  occurs  at  the  photodetector,  and  the  feedback 
path  is  entirely  classical.  Thus,  if  the  open-loop  illumina¬ 
tion  does  not  require  the  use  of  quantum  photodetection 
theory,  then  neither  does  the  closed-loop  illumination. 
Conversely,  if  the  open-loop  illumination  is  nonclasaical, 
then  the  closed-loop  behavior  must  be  analyzed  quantum 
mechanically.  For  example,  suppose  the  Es  field  in  the 
DTMPP  analysis  is  in  a  broadband  squeezed  state  with  the 
mean  field  given  by  Eq.  (35)  and  low-frequency  first -quadra¬ 
ture  noise  squeezing40 

{or  W  -  1/TT<f  <71) 

with  7  «  1.  Equations  (67)  and  (68)  then  become 

var(N()  *  (N,)  [1  -  (1  -  7)rj/(l  +  Ard)]/(1  +  Ard)2  (72) 
and 

var(N'/)  a  <N/>[1  +  Ard/(1  +  Ard)2  -  f|Ard(l  -  y) 

X  (2  +  Xrd)2/(1  +  Ard)3],  (73) 

respectively.  For  Xrd  »  1  and  n  *  1,  the  out-of-loop  photo¬ 
counts  Nr'  are  strongly  sub-Poisson,  a  result  that  cannot  be 
obtained  semiclassicallv  as  N,'  is  an  open-loop  count  record. 

B.  Negative-Linear-Feedback  Process 

Our  second  example  of  closed -loop  photodetection  is  the 
N’LFP  whose  semiclassical  construct  is  sketched  in  Fig.  8. 
In  this  arrangement,  the  in-loop  photocurrent  i,  drives  an 
optical  intensity  modulator  through  a  causal  linear  time- 
invariant  filter  of  impulse  response  h/(t).  As  a  result,  for 
nonrandom  input  light  of  constant  power  P0,  the  optical 
power  emerging  from  the  intensity  modulator  obeys  the  neg¬ 
ative-feedback  law 

P.ll !.:  r  <  t|)  =  P„  -  <P,/q)  j  i.h,(t  -  r)dr,  (74) 

conditioned  on  knowledge  of  the  in-loop  event  history.  To 
ensure  that  Eq.  (74)  represents  negative  feedback,  it  is  suffi¬ 
cient  to  require  P,  >  0  and  h,it)  >  0  for  all  t.*1  Because 
P,(|i.:r  <  r|)  >  0  must  prevail  under  all  circumstances,  Eq. 
(74)  should  be  viewed  as  an  approximation,  even  under  the 
preceding  negative-feedback  conditions,  whose  validity  re¬ 
quires  that  the  feedback  term  on  the  right-hand  side  in  Eq. 
(74)  be  smaller  than  the  input  power  with  overwhelming 
probability.  It  is  interesting  to  note  that  the  DTMPP  is  a 
NLFP  with  h,(t )  =*  hd(f )  and  Pi  M  Pv.  For  the  DTMPP  no 
probabilistic  restriction  was  needed  to  guarantee  that  P,  >  0. 
but  stochastic  linearization  expedited  the  analysis.  In  what 
follows,  we  treat  the  NLFP.  with  an  arbitrary  negative  feed¬ 
back  hi(t).  using  a  similar  stochastic  linearization  both  to 
ensure  P  >  0  and  to  effect  steady-state  statistical  analysis. 
Moreover,  to  make  explicit  the  behavior  of  our  general  re¬ 
sults.  we  use  the  single-pole  filter 

exp'  — '  -  >  f  >  0 

/!.(('  =  ■  .  (75i 

o  otherwise 

a-  .i  running  example  We  treat  the  first-  and  second-mo- 
men:  -i.it ist ii  -  ni  i  he  in  - loop  phi >i i wurrent  ;•  and  the  out  -ot - 
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loop  photocurrent  t,'  using  the  semiclassical  and  quantum 
theories,  beginning  with  the  former. 

We  have  from  stationary -process  MCR  theory  [see  Eqs. 
(13)  and  (7))  that 

(i)  «  qtt'j  *  q  lim  [At-1  Pr(A N,  *  D]  (76) 

ai—o 

and  likewise 

(i'>  ■  q  lim  [At-1  Pr(AN,'  “  1)].  (77) 

41—0 

If  we  employ  iterated  expectation  on  the  right-hand  side  in 
Eq.  (76),  using  the  incremental  SEPP  description  [Eqs.  (3i- 
(5))  with  mi  “  niPt/h*o  for  <  the  beam  splitter's  intensity 
transmission,  we  find  that 

<i)  =  qij<  <P,)  /hi^  =  (qj)</hn0)[P0  -  (Pp'q)  <i)  ffyO)],  (78i 

where  H,{f)  is  the  frequency  response  associated  with  h.lti. 
Equation  (78)  can  be  solved  for  (i>,  yielding 

(i)  m  (q»j«P0/h»'0)/[l  +  (rjtP^/he^H/tO)}.  (79) 

which  reduces  to 

(*>  *  {q-mPJhvJHX  +  i\tPxTtlhvJ  (80) 

for  the  single-pole  filter  example.  In  a  similar  manner  we 
can  show  that 

(i')  =  qy(\  -  f)  (P,)  /hv0.  (81) 

Somewhat  greater  effort  is  required  to  deduce  the  second 
moments,  which  are  dealt  with  below. 

From  Eqs.  (14)  and  (7)  of  the  stationary-process  MCR 
theory,  it  follows  that  the  in-loop  photocurrent  covariance  is 
of  the  form 

cov(i,^„  i.)  =  q  (t>  6(t)  +  7f„(r  1,  (82) 

with  a  symmetric  nonsingular  component  given  by 

JijT)=qz  lira  |(At)“2  Pr(A.V(<.tAN.  =  l]i  -  <i):  (83 1 

41—0 

for  r^O.  By  iterated  expectation  and  the  incremental 
SEPP  description,  we  obtain 
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which  is  easily  reduced  to  the  following  Wiener-Hopf  equa¬ 
tion4-; 


Hi t  )  =  —  ( rjtPyhu.. 


Q  < 


i  >h,(r)+  j  H„(--s)h,is  Ids 


(Soi 


for  r  >  0.  With  the  same  approach,  we  find  that  the  out-of¬ 
loop  photocurrent  covariance  is 

cov(i(<..'.  i.'l  =  q(i)oir)  +  AlV(r),  (86) 

where  the  symmetric  nonsingular  component  A, .  satisfies 

■Ht  M\  =  -1^(1  -  ()P, />»-(,]  f  cov(ir<.r_„,  i.')h,(s)ds  (87) 

Jo 

for  t  >  0  and  the  nonsingular  cross  covariance  between  the 
in-loop  and  out-of-loop  photocurrents  satisfies 

|il  -  ()/<)7f11i7i  for  r  <  0 
[</l  1  -  ()]  'A, ,  (tI  for  r  >  0 


covii  i.'l 


(88) 


The  solutions  to  Eqs.  (82 >— < 88 1  are  not  difficult  to  obtain 
and  are  best  expressed  in  terms  of  the  noise  spectral  densi¬ 
ties  associated  with  the  photocurrent  covariances  and  the 
noise  cross-spectral  density  associated  with  the  photocur¬ 
rent  cross  covariance.  The  results  are41 


has  a  while-noise  spectrum  equal  to  the  sum  of  the  i:  and  ■ 
shot  -noise  levels 


VI’! 


(92 1 


whereas  the  in-loop  photocurrent  has  a  sub-shot-noise  spec  • 
trum  at  frequencies  for  which  1  +  ")(Pi-hi',,lW.t/)i  >  I.  and 
the  out  -of- loop  photocurrent  has  a  super-shot-noise  spec¬ 
trum  at  ail  frequencies.  These  characteristics  are  illustrat¬ 
ed  in  Fig.  9.  where  we  have  plotted  the  normalized  spectra 
5  *  f\;q •  i .  and  -S  i/i  u'  f  ’  for  the  single- pole  filter  example 
with  i  =  ■ . 

The  preceding  results  bear  on  the  recent  experiments  of 
Machida  and  Yamamoto-'1  and  Yamamoto  et  al.:i  These 
authors  used  a  GaAs  AlGaAs  injection  laser  diode  to  gener¬ 
ate  light  and  a  Si  P-l-N  photodiode  to  detect  it  Negative 
electrical  teedliack  from  the  detector  was  provided  to  the 
current  driving  the  laser  diode  Operation  in  a  configura¬ 
tion  analogous  to  Fig  v  with  <  =  ..  then  led  to  a  sub-shot - 
noise  in -loop  photoi  urrent  spectrum,  a  super-shot -noise 
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states,  showed  there  was  an  equivalent  semiclassical  formu¬ 
lation  for  the  experiments  reported  in  Refs.  20  and  21  Our 
preliminary  semiclassical  work  appeared  in  Ref.  16;  it  sepa¬ 
rates  the  feedback  loop  from  the  laser  source  by  employing 
an  external  intensity  modulator,  as  shown  here  in  Fig  8.  In 
what  follows  we  present  our  quantum  analysis  of  the  i.  and  o' 
statistics.  After  showing  how  Eqs.  (80 1 .  (81 1,  and  ' 89 1— ( 91 ) 
are  reproduced  quantum  mechanically,  we  contrast  their 
semiclassical  and  quantum  interpretations. 

Our  quantum  treatment  of  the  NLFP  parallels  our  quan¬ 
tum  DTMPP  development.  It  is  based  on  the  arrangement 
shown  in  Fig.  10,  in  which  a  coherent -state  signal  field  £*(x. 
t)  with  mean  given  by  Eq.  (35)  illuminates  in-loop  and  out- 
of-loop  photodetectors  through  a  lossless  modulated  beam 
splitter  and  a  lossless  ordinary  beam  splitter.  The  modulat¬ 
ed  beam  splitter  yields  both  an  in-loop  Field 

£tx,  t)  =  [nr)]12£s-(x.  r)  +  |l  -  Tit)]1  :£qix.  n.  ( 93 1 
in  terms  of  the  beam-splitter  transmission 

Tit)  =  (PJP )  -  iPy’Pqi  J  i.hAt  -  t id 7 .  < 94 1 

and  a  vacuum-state  field  operator  £.wtx.  t).  The  in-loop 
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and  out-of-loop  photocurrents  correspond  to  the  operator 
measurements.  Eqs.  (40)  and  (41),  respectively,  where  the 
effective  fields  £iN'  and  £orr’  are  obtained  from  Eq.  (42) 
and  the  ordinary  beam-splitter  relations 

£1N(x.  f)  =  <',2£ix.  t)  +  (1  -  <)‘  2£fl(x.  f).  (95) 

£orr(x.  r i  =  — ( 1  —  <)'  :£(x.  r)  +  fl/!£h(x.  rt.  (961 

where  yet  another  vacuum-state  field  operator  £g  is  intro¬ 
duced.  The  introduction  of  the  vacuum-state  fields  l£«,  £«) 
comes  from  the  necessity  of  preserving  open-loop  commuta¬ 
tor  brackets  after  propagation  through  the  beam  splitters 
when  the  feedback  path  is  absent.  The  vacuum-state  fields 
|£,ny,  £olt.  !  are  required  for  lossy  ( 77  <  1)  detectors  because 
of  the  fluctuation-dissipation  theorem,  which,  loosely  stat¬ 
ed.  implies  that  quantum  noise  is  injected  whenever  loss  is 
encountered,  again  because  of  commutator  conservation.45’ 
An  investigation  of  closed-loop  NLFP  commutator  behavior 


real-quadrature  fluctuations,  i.e. ,  those  in  phase  with  the 
mean  field. 

The  first  and  second  moments  of  1,  and  i,'  are  easily  ob¬ 
tained  from  Eqs.  (98M102).  With  the  linearized  formula¬ 
tion  we  know  that 

<h>  =  QAdu£,x'))- 

=  (qr)tPJhi>0)/[\  +  (TttPl/hi'a)Hf(0)]  (103) 

and 

<f/>  =  <?/!,.(  (£0l.T'))2 

=  [<?>r<l  -  e)P0/he0 ]/[l  +  (rtrP./Kl/tyO)],  004) 

in  agreement  with  the  semiclassical  results  [Eqs.  (79)  and 
(611).  Moreover,  by  introducing  Fourier  transforms  as  in 
Eq.  (591.  we  can  solve  the  first-quadrature  relations  in  Eqs. 
( 100)  and  (101),  with  the  following  results: 


A£,m'</>  = 


[p(  < r>r  +  IkO  -  (T))]1  -£^(f)  +  1»(1  -  <)]'  -£bi(^  +  q  -  I)1 -£,v,(fl 

1  +  (i)(P,/hi’0lH.l/) 


(105) 


will  be  of  interest  below-,  and  toward  that  end  we  introduce 
here  the  auxiliary  output-field  relation  for  the  modulated 
beam  splitter: 

£^rx(x.  n  =  -[1  -  Tit)]1  -£s(x.  m  +  [7'( r  )J 1  -£wix.  o. 

(97) 

At  this  juncture,  the  analysis  ran  proceed  rapidly.  Invok¬ 
ing  an  operator  linearization  of  Eqs.  I93M96).  as  was  done 
lor  the  DTMPP.  and  introducing  time-dependent  quadra¬ 
ture  operators,  in  a  manner  similar  to  Eq.  (5(1).  we  find  that 

(£lx  =  :n<P„  hr, .A  [1  +  (ipP  hi-j/f.lOlj!1  -.  (98) 

<£,„  =  -[)  1  -  r)  V[!  •’(£,s'>.  (99) 

-l£lv'( ' 1  =  >’  T  ] '  -a£..  (t )  +  |rj<(  1  —  <  T )  l)1  • 
x  £u  in  +  lud  -  d]1  :£H.(ti 

i-  I  1  —  n ) 1  '£, v  O'  _  i,  liJtP,  hr.  I 
X  j  a£|N|'i  r  1/1, If  -  rldr,  (1001 

and 

a£, ,!  T  it 1  =  -|ni  1  -  ( 1  <  T ■['  -A £,  tM  -  [iji  1  -  <) 
x  1 1  —  (T>)];  -'£,,. m  +  (n<)'  -’£„,(() 

'  1  -  1  ('  '£(11  T,  tn  *■  "I  [<t  1  -  <  >  1 1  ■ 

X  1  rP  'hr,  I  J  A£|\.  ”t  r  1/1 .( t  -  si.  (Kill 

In  Eii-.  1  P'o.  _md  1 !  1  >  1  ) 

T  =  'p  Pi  [1  -*-  "7 >P  h,  I H  mi]  Iluji 

.»  'In*  .iM-r.u*-  111* niul.ii (*f I  Ix-.i m - -pi : ! t er  iransnu»it>n.  and 
=  I.  J  f n ■  ■  ’ r  ’Mr  rc.iJ  am!  irn.iL'inun  tjuadrnune>.  re>pei - 
';\**r  -  : .  \"t .■  1  ::.u  : In- -rt-dhai  k  a! h*»!'-*ni\i!n- 


and 

a£01t,'(/1  =  -lull -<XT)|,'-a£>1(P 

-  [»)( l  -  f)(l  -  (T))]1  ’£„,(/)  +  (t,()‘ 

x  £h,(P 

(1  -  n)1  :£oit,  dA  +  [«1  ~  e )] 1  : 

x  ( ijP1/hi-„)A£|S|'(/’)H.(/3.  (106) 

From  these  solutions  we  then  obtain  the  quadrature  noise 
spectra  |see  Eq.  i6a)| 

Sf. _,si  (/)  =  1  4 1 1  +  (T)(Pt.hi  [t)H Afy~.  (10?) 

•Sl  1  (f)  =  4  1  +  <t  1  —  ( )  ’  l  tjP,  h  i\s)H A  f)  l-/4!  I 

+  ( t)<P,  hr,, )H. (fit-,.  (108) 

and 

=  l«l  -  ‘'I1  :")P;-h..,i  «  ‘(/'lNtivf.^lf). 

(109 1 

The  quantum  second-moment  derivations  are  completed  hy 
substituting  Eqs.  <  107i-<  109)  into  the  relations 

S,  1/1  =  4q  f,s  '/'■  1 1  lot 

.S  ,(/)  =  4 <J</")b'/„„,.f.illT  '/I.  ’ll!' 

and 

>’  1/1  =  -4c |  :  .'  1  ■>',  i,-’  'll: 

whnh  ti  How  trom  iinearizatton  \t  tornun.is  '’o' 

The  photm  tirrent  spectra  and  1  russ  spectrum  tho-  mien 
1  oitti  itie  w  ith  the -enm  lassic.ii  tormui.js  Eqs  ivi-  -1'. 
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system  When  the  feedback  path  to  the  intensity  modulator 
is  broken,  i,  and  are  statistically  independent  stationary 
shot-noise  processes  arising  from  the  independent  random 
atomic  excitations  occurring  in  the  two  detectors  under 
steady  classical  illumination.  Closing  the  loop  reduces  the  i, 
noise  level  at  frequencies  within  the  loop  bandwidth  as  in 
any  negative-feedback  stabilization  scheme.'"'  This  in-loop 
reduction  comes  about  by  modulating  the  incoming  light 
with  a  filtered  version  of  the  i,  shot  noise.  Insofar  as  the  out- 
of-loop  detector  is  concerned,  said  modulation  constitutes 
an  excess  noise  on  its  illumination  in  the  usual  open-loop 
sense.  As  a  result,  the  out-of-loop  photocurrent  shows  su¬ 
per-shot-noise  fluctuations  within  the  loop  bandwidth. 
Subtracting  the  normalized  photocurrents  precisely  cancels 
the  negative-feedback  noise  reduction  on  ft,  with  the  ran¬ 
dom-modulation  noise  increase  on  i,'  leading  to  a  sum-of- 
shot -noise  spectrum  for  i_. 

Quantum  mechanically,  the  feedback  loop  reduces  the  in¬ 
loop  photocurrent  spectrum  by  squeezing,  within  the  loop 
bandwidth,  the  quadrature  fluctuations  of  £is<x.  t)  that 
spatially  match  and  are  in  phase  with  its  mean  field.  Be¬ 
cause  of  the  r-rad  phase  shift  that  exists  between  the  rela¬ 
tive  phases  of  the  £  and  £*  contributions  to  £|\  and  £oit. 
which  is  a  consequence  of  energy  conservation.1,  the  nega¬ 
tive-feedback  loop  exacerbates  the  quadrature  fluctuations 
of  £0iT(x,  (l  that  spatiallv  match  and  are  in  phase  with  its 
mean  field.-11-;  As  a  result,  i-'  has  a  super-shot-noise  spec¬ 
trum  within  the  loop  bandwidth  When  the  photocurrents 
are  normalized  and  subtracted,  the  noise-current  operator  is 

Ai_lM  ^  f_(M  -  U.)  =  |il  -  fl/<)'  J-i;.  -  |</(1  -  <)]'  \ii.’ 

=  2(</t  1  -  <  i  <i ) -'A£|N.'(f  t 

+  2|(/(  1  .'  '  1  —  <  1 1  ‘  ‘.l£,  ii  T;  •<  1 

=  2  [  q  i  i  —  :  ’  )  1 1 1  ‘  [  1 1  —  f  i '  '  A£  |  s  |  <  f  I 

+  «'  -a£m,  ,  :inj 


and 

l£Sl(f).£S2<n]  -  <i/2)Mf  -  n.  di8i 

When  the  field  fluctuations  are  statistically  stationary,  the 
latter  imply  the  Heisenberg  uncertainty  limit 

sEs,EjnsE^jn  ^  me  ms) 

for  the  quadrature  fluctuation  spectra.  Coherent-stale 
light  achieves  the  minimum-uncertainty  product  in  expres¬ 
sion  (119)  with  equal  noise  strength  in  each  quadrature; 
squeezed-state  light  achieves  the  minimum-uncertainty 
product  in  expression  (119)  with  unequal  noise  strength  in 
each  quadrature. 

Consider  the  uncertainty  products,  similar  to  the  left- 
hand  side  member  of  expression  (119),  for  the  effective  fields 
£in '(/)  and  £ovr'U)  that  drive  the  in-loop  and  out-of-loop 
photodetectors.  We  already  have  expressions  for  the  first  - 
quadrature  noise  spectra.  The  following  second-quadra¬ 
ture  noise  spectra  can  be  deduced  immediately  from  Eqs. 
( 100)  and  ( 101 ): 

Sr  c  (f)  =  SE  E  (/)  *  1/4.  (1201 

<  tolT?tol-T7  < 

Equation  (120)  shows  that  both  second-quadrature  noise 
spectra  are  coherent-state  results.  Physically,  this  is  so 
because  the  linearized  intensity-modulation  feedback  does 
not  affect  the  second-quadrature  fluctuations  as  the  latter 
are  >r/2  rad  out  of  phase  with  the  mean  field.  Because 
Sf.ol.Tl  £mT1  if)  >  1/4.  we  see  that  the  free  field  £opt'  °bevs 
the  usual  uncertainty  principle  [expression  (119)].  On  the 
other  hand.  SfcIN1  £ini  (/)  <  1/4  prevails  within  the  loop  band¬ 
width.  so  that  £in  violates  the  free-field  uncertainty  princi¬ 
ple  If  we  use  the  frequency-domain  result  [Eq.  (105))  and 
the  corresponding  formula  for  \£  in/)/’),  we  can  easily  show 
that 

(£is'(r).  £,N'(/'l|  *  0  (121) 


=  2|i/i'i>  +  ;>)'  in.  (113) 

from  which  the  sum-of->ht>l-noises  formula  (Eq.  (92l|  fol¬ 
lows  immediately.-' 

C.  Commutator  Relations 

Additional  insight  into  closed-loop  photodetection  can  be 
developed  by  examining  the  field  commutators  for  the  quan 
turn  NLFP  configuration  shown  in  Fig  10.  For  notational 
compactness,  we  limit  our  discussion  to  spatiallv  integrated 
time-dependent  photon-unit  field  operators  of  the  form 

£jm  =  .-4,"'-i  dx£^ix.  n.  (114) 

.a. 

etc  Free  fields  of  this  tvpe  have  the  following  commutators: 

|£.in.  £.<r  ’!  =  o  < 1 1 5 > 


and 


1  £  a  m  .  £ . ' ' :  1  j  =  7i  /  —  :  i  1 1 1 0 1 

whu  h  .ire  equivalent  1"  the  quadrature  operator  commuta¬ 
tor- 


|£\  i  •  i.  £..i'  i  =  E.  in.  L  i|  =  n  <117. 


and 


l£|s  (().  £in"(CI|  =  jdf  1 <122) 
IN  <n  i  j  1  +(„</*,  h.  jHaP 

from  which  the  Heisenberg  inequality 


b£IV£,„<^/.ls,ils:</'t  2  1/1611  +  <»<£,  hi..//f .i/’i  •  (1231 


Ki_  !"  tju.iniiun  nh‘»titdrtp< ! n«n  i  i»ntitur,H i<  !  r  ■  *>•  \ !  K T  >  \ 
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follows  readily.  The  in-loop  effective  field  is  operating  at 
this  uncertainty-product  limit  at  all  frequencies. 

The  peculiar  nature  of  the  in-loop  effective-field  commu¬ 
tator  might  be  evidence  that  we  have  inadvertently  omitted 
a  quantum-noise  contribution  from  our  closed-loop  analysis. 
It  is  germane,  therefore,  to  examine  the  commutators  for  all 
the  field  operators  involved  in  the  arrangement  shown  in 
Fig.  10.  The  input  operators  l£sU),  £»U),  £b(0.  £|Ni(tl, 
£outyUH  are  commuting  free-field  operators  obeying 

I£,(t),  £,■<:')!  -0  (124) 


l£,(f).£;’in]  =  bjjMt  -  n  U25) 

for  j,j‘  =  S,  M,  B,  INf,  OUTc.  Whether  or  not  the  feedback 
loop  is  closed,  the  open-loop  output  operators  |£alx(H, 
£oitUI,  £oct'(H|  are  free  fields  whose  commutators  must 
also  satisfy  Eqs.  (124)  and  (125).  That  they  do  so  can  be 
demonstrated  by  means  of  Eqs.  (93)-<97)  and  the  lineariza¬ 
tion  procedure  used  to  derive  the  quadrature-noise  spectra, 
without  recourse  to  additional  quantum-noise  sources. 

The  in-loop  field  operators  |£(t),  4£|y(f),  £in'U)!  are  not 
free  fields  when  the  loop  is  closed,  and  thus  they  need  not 
have  the  usual  commutators.  We  have  already  found  the 
£in  (t)  commutators.  By  similar  calculations  we  obtain 

[£u>.  £<nl  =  [£IN(t),  £ls<n)  =  o  U26) 

and 

[£(m,  £’(f))  =  [£ls(f),  £[N’(t')) 

=  jdf  _c°sl2-/u  -  n|  (12T) 

I  1  +  (ij<P1/hi„)H;(/’) 

i.e..  all  three  in-ioop  fields  share  the  same  non-free-field 
commutator  behavior. 

The  special  character  of  the  in-loop  commutators  [Eqs. 
(1221  and  ( 127)]  is.  we  believe,  the  hallmark  of  the  quantum 
closed-loop  theory.  It  permits  the  in-loop  photocurrent  to 
have  a  sub-shot-noise  spectrum,  which  is  the  semiclassical 
signature  of  the  closed  loop,  even  though  all  the  free-field 
input  operators  are  in  coherent  states.  Indeed,  were  we  to 
have  (£|N  (n.  £ts'(t')]  =  0  and  |£]N'(M.  £|N’(t')]  *  6(t  —  ('), 
with  £is(()  being  a  linear  combination  of  only  the  coherent - 
state  field  operators  (£.irl,  £«(().  £/,ir).  £iN,(f)[,  i,  would 
have  to  be  a  DSPP  Its  spectrum  could  then  never  reach 
sub-shot-noise  levels. 

The  preceding  discussion  confers  a  special  closed -loop 
status  on  the  in  loop  fields  [£((),  £inW  ),  £ix'(t)]  in  the  appa¬ 
ratus  shown  in  Fig.  10.  This  extraordinary  behavior  can. 
nevertheless,  be  reconciled  with  the  obvious  facts  that  £(x. 
f)  represents  a  field  in  the  free-space  region  between  the 
modulated  and  ordinary  beam  splitters  and  £in(x,  1 1  repre 
sents  a  find  in  the  free-space  region  between  the  ordinary 
beam  splitter  and  the  in-loop  photodetector.  The  key  is 
optical  propagation  dela\. 

Our  entire  closed-loop  analysis  is  <>!  a  lumped-element 
charaiter.  viz.,  no  propagation  delay  whatsoever  is  included 
between  the  op'ical  elements  ot  the  configuration  of  Fig  In 
It  . -  well  known  that  de.j\  in  j  classical  leedback  loi.piun 
stronglv  otieit  us  perlorm.ime  Let  us  see  what  etfi-ci- 


optical  propagation  delay  has  on  the  quantum  NLFP  statis¬ 
tics  and  field  commutators. 

Suppose  that  there  is  a  Tp-sec  propagation  delay  between 
the  modulated  beam  splitter  and  the  ordinary  beam  splitter 
and  that  there  is  no  propagation  delay  between  the  ordinary 
beam  splitter  and  the  in-loop  and  out-of-loop  photodetec¬ 
tors.  In  place  of  Eqs.  (95)  and  (96)  we  then  have 

£,n(x.  t)  =  t1/2£(x,  t  -  rp)  +  (1  -  <)1/2£B(x,  f ),  (128) 

£olt<*.  *>  *  -<1  -  <)1/2£(x,  f  -  rp)  +  (1/2£b(x.  t).  (129) 

Mimicking  the  development  of  the  no-delay  case,  we  can 
show  that  the  mean  photocurrents,  the  photocurrent  spec¬ 
tra.  and  the  cross  spectrum  all  take  the  forms  given  previous¬ 
ly  with  H/if)  replaced  by 

Hfif)  =  exp(  -  i2-fTp)H,(f).  (130) 

Thus  the  physical  discussion  concluding  Subsection  3.B  con¬ 
tinues  to  apply,  insofar  as  moment  behavior  is  concerned, 
subject  to  the  impact  of  the  delay  factor  on  the  right-hand 
side  of  Eq.  (130)  on  the  achievable  noise  squeezing.  Said 
impact  is  illustrated  in  Fig  11,  where  we  have  plotted  S„(/)/q(i) 
for  the  single-pole  filter  example  with  <  =  %  c/Vj/heo  *  8. 
and  various  Tp/r^  values.48 

Now  let  us  consider  the  commutator  behavior  when  opti¬ 
cal  delay  is  included.  Here  we  find  that  the  open-loop  out¬ 
put  operators  |£acx(H.  £orr(H,  £oct'(0|  continue  to  have 
the  free-field  commutators  [Eqs.  (124)  and  (125)],  whereas 
the  in-loop  field  operators  |£lr),  £ ( N ( r ) ,  £|N'(f)|  have  the 
commutators  given  in  Eqs.  (121),  (122).  (126).  and  (127)  with 
H,(f)  replaced  by  H/if)  from  Eq.  (130)  If  we  now  write 

[d  cos[2ir/((  —  f')| 

I  1  + 


=  Ht  -f')-  d f 


(rj<P|/hi'.,)W,'(/)cos(2ir/(f  —  f'l] 

1  + 


we  can  use  the  causality  of  ff. if)  to  prove  that  the  integral 
term  on  the  right-hand  side  in  Eq.  (131 1  is  zero  for  b  -  r'i  < 
rp.  Thus,  for  time  differences  smaller  than  the  optical  prop¬ 
agation  delay,  the  in-loop  fields  have  free-field  commutator 
behavior.  Physically,  this  means  that,  at  anv  time  i.  the 
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fields  present  in  the  free-space  regions  between  the  modu¬ 
lated  beam  splitter  and  the  ordinary  beam  splitter  and  be¬ 
tween  the  ordinary  beam  splitter  and  the  in-loop  photode- 
tector  have  no  special  closed-loop  status.  It  is  the  feedback 
action  over  time  intervals  in  excess  of  the  optical  propaga¬ 
tion  delay  that  leads  to  the  non-free-field  forms  in  Eqs.  <  1 22 > 
and  (127).  These  interpretations  can  be  tested,  conceptual¬ 
ly.  by  studying  the  transient  behavior  of  the  in-loop  photo- 
current  statistics  when  the  feedback  path  is  suddenly  bro¬ 
ken.  If  the  field  that  exists  at  any  one  time  between  the 
modulated  beam  splitter  and  the  in-loop  detector  is,  in  es¬ 
sence,  a  classical-state  free  field,  then  breaking  the  feedback 
path  at  time  f0,  when  the  loop  was  in  its  steady  state,  must 
make  jt,:f  >  to!  a  DSPP.  The  proof  is  as  follows. 

Suppose  that  the  closed-loop  system,  incorporating  the 
rp- sec  propagation  delay,  is  in  its  statistical  steady  state  at  t 
=  to  and  that  the  feedback  path  is  broken  by  freezing  the 
modulated  beam  splitter's  intensity  transmission  for  t  >  t„. 
i.e..  by  forcing 

Tin  =  {T i  T>  /Phi  ,,)1  ;  j  A£1M'<T»h.'ir„  -  rldr 


to  prevail  for  t  >  r,„  where  h,'(t\  is  the  impulse  response 
associated  with  H-'tf)  and  ( T)  is  still  given  by  Eq.  (102).  We 
then  find  that  Eq.  (IOOi  becomes 

a£,s  'It )  =  |ijf  ( T)]1  :x£>,U  —  rpl  +  (rjd  1  —  ( T i )] 1  " 

X  £V:(r  -  7p)  +  (|)1 1  -  cl]1  -£ft,U  I 

+  ( 1  -  rj) 1  -£,S[  .Irl  —  <)..l(r)<P1//ii'0l 


A£,..  ,’irlh.  (I, 


for :  2  :,,.J  =  1.2.  Using  the  remarks  surrounding  Eq.  ( loll, 
we  can  show  from  Eq.  i  Idol  that 
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commutators  for  all  times  and  hEFrit)  is  the  impulse  re¬ 
sponse  associated  with  the  frequency  response: 

HKyr(f)  - - ‘  —  -  •  (138) 

1  +  (ncPt/ht>0)H ,  (f) 


|£,n'i;i.£,n'iU'I  =(> 


So.  because  h^FFd)  is  zero  for  t  <  tp  |cf.  Eq.  (131)].  the 
integral  term  multiplying  on  the  right-hand  side  of  Eq. 
(136)  can  be  replaced  with  a  classical  zero-mean  real-valued 
Gaussian  random  variable  of  variance  4~l  J df\H eff*/"'  This 
demonstrates  that  differs  from  the  coherent-state 

field  operator  a£Eff «)  on  t  >  to  only  by  a  c-number  random 
variable.  Hence  |£[N(f):f  >  tol  is  in  a  classical  state,  and  !i,:t 
>  1 0|  is  a  DSPP. 

4.  NONCLASSICAL  FIELD  EXTRACTION 

In  this  our  concluding  section,  we  address  the  use  of  nonclas- 
sical  light-beam  correlations  to  extract  nonclassical  open- 
loop  fields  from  closed-loop  photodetection  systems. 

A.  Dead-Time-Modified  Poisson  Process/Photon-Twins 
Field  Extraction 

Parametric  downconversion  and  atomic-cascade  emission 
processes  both  yield  the  photon-twin  light  beams  that  were 
suggested22  23  and  recently  used25  for  producing  open-loop 
sub-Poisson  light  by  gating  procedures  akin  to  the  arrange¬ 
ment  that  we  have  dubbed  the  DTMPP.49  A  basic  structure 
for  such  experiments  is  shown  in  Fig.  12.  The  photon-twin 
source  produces  two  spatially  disjoint  quantum  fields  that 
illuminate  identical  in-loop  and  out-of-loop  photodetectors 
through  flip-mirror  arrangements  driven  by  the  in-loop  pho¬ 
tocurrent  in  the  manner  of  Fig  5.  In  parametric  fluores¬ 
cence.  the  photon-units  fields  £kkf<x.  n  (center  frequency 
i'ukf*  and  £-it.(x.  tl  (center  frequency  rsn;)  are  each  in  clas¬ 
sical  states  ";  yet.  because  of  energy  conservation  at  the 
photon-generation  level,  they  have  perfect  photon-flux  cor¬ 
relation.  namely. 

var[ ;  dx|£.M(i'<x.:)£.,|l;(x.  n  -  ?)£KKr(x.  f)]j  =  0. 


|£1N  m.  £in"'iC>]  =  <■>(/  -  n  (135) 

hold  for  !.  ('  >  t.  Thus  the  effective  photon-units  field 
driving  the  in-ioop  photodetector  for  the  t  >  t„  transient 
regime  has  tree-field  commutator  behavior.  To  prove  !t,:f  > 
t,.l  is  a  DSPP.  we  need  only  show  that  l£| g'(t):t  >  t,,l  is  in  a 
classical  state. 

Using  our  frequency-domain  solution  to  the  steady-state 
feedback  form  (Eq.  i  IOOi  with  deiav  included],  we  can  show 
that 

2l£(n  I  / 1  =  Jl£ y  y y  ( t )  7 ,  f  XE ff; 1 7  lh fff  ( (  r)dr 


for  =  1.  2  and  :  >  where 
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This  correlation  is  exploited  by  using  photodetections  from 
the  reference  field  to  gate  both  the  reference  and  the  signal 
fields,  resulting  in  a  sub-Poisson  out-of-loop  photocount 
record,  as  will  be  shown  below. 

Let  us  assume  that  the  detection  area  Ad  comprises  a 
sufficiently  large  number  of  spatial  modes  of  th  ■  parametric 
fluorescence  that  the  low-photon  coherence  condition01  ap¬ 
plies.  We  can  then  linearize  the  in-loop  and  out-of-loop 
effective  photon-flux  operators  about  their  mean  values  and 
show,  using  techniques  similar  to  those  in  Subsection  3.A, 
that 
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correlation  permits  the  photon-flux  behavior  of  the  signal 
field  to  be  inferred  from  a  homodvne-detection  phase  mea¬ 
surement  on  the  probe  field.  This,  in  turn,  permits  the 
feedback  loop  to  be  closed  by  using  the  probe-field  phase 
measurement  instead  of  the  signal-field  direct-detection 
measurement.21 

To  make  the  preceding  remarks  explicit,  let  the  input  signal 
and  probe  fields  have  strong  mean  values 


<£,(*.  t)>  =  (P./hrAj' 


(144) 


<N,)  =  <.V/>  =  \t/(l  +  Xrd) 
and 

vari.V.)  =  <iV,)/(l  +  \rd)i. 


(140) 


(141) 


vari.V)  =  (,\V>|1  +  (1  —  rj)2.\-ra(  1  +  Xtu)]/(1  +  \Td)2.  (142) 

Here.  X  =  tiPref^^ref  =  nPs in^csic  gives  the  average  open- 
loop  photoemission  rate  in  terms  of  the  average  fluorescence 
power  P  divided  by  the  photon  energy  hv  at  the  reference 
and  signal  frequencies,  and  the  detector  quantum  efficiency 
17  was  assumed  to  be  the  same  at  both  of  these  frequencies. 
When  \rd  »  1  the  out-of-loop  photocount  record  will  be  sub- 
Poisson  with  a  variance  limited  by  the  imperfect  effective 
photon-flux  correlation  caused  by  thequantum-mechanical- 
ly  independent  9  <  1  quantum  noise  introduced  at  the  in¬ 
loop  and  out-of-loop  detectors,  viz.. 


vari.V.')  *  (,V.')2(1  -  v). 


(143) 


Production  of  strongly  sub-Poisson  open-loop  light  by  using 
the  DTMPP  photon-twins  route  will  therefore  require  near- 
unity  photodetector  quantum  efficiencies  at  both  nu:F  and 


B.  IVegafive-Linear-Feedback  Process/Quantum 
Nondemolition  Field  Extraction 

Yamamoto  ot  al.-1  suggested  that  a  Kerr-effect  QND  mea¬ 
surement-1  be  used  to  extract  a  sub-shot-noise  open-loop 
field  from  their  teedback-modulated  semiconductor-laser 
NLFP  system.  They  also  showed-’1  14  how  the  back  action  of 
the  QND  measurement  increases  the  second-quadrature 
1  phase  1  noise  on  the  extracted  sub-shot-noise  out-of-loop 
field,  imparting  to  this  field  sufficient  quantum  noise  to 
make  its  commutators  take  on  the  requisite  free-field  char¬ 
acteristics.  These  results  can  easily  be  reproduced  in  our 
NLFP  construct,  wherein  the  modulation  is  external  to  the 
laser,  as  sketched  out  below.  The  behavior  of  the  semicon¬ 
ductor  laser  itself  plays  no  part  in  nur  treatment. 

Consider  the  NLFP'QND  arrangement  shown  in  Fig.  13. 
In  this  setup,  two  strong  coherent-state  fields,  the  center- 
trequencv  signal  field  £,ix.  /).  and  the  center-frequency 
1 ,,  probe  field  1  x. 1  are  the  principal  inputs.  The  former 
transits  3  modulated  beam  -plitter.  in  the  manner  ot  the 
ordinary  i  Fig  lt)i  NLFP  experiment,  before  interacting  with 
the  probe  field  111  the  Kerr  medium.  The  cross- phase- mod 
illation  interaction  .  hi  or  ring  over  an  Lm  pat  h  in  t  hat  medi 
urn  engender*  omnLi— u  al  coupling  between  the  photon 
flux  den*it\  ot  the  <  enter  1  requencv  1  -  signal  field  £.  ,1  ■  <  x  1 
iha-e  -  -  t  ?  lie  <  enter  ;  requetn  \  -  probe  1  leld  /: ;  -,1  \ 

-how  n  b\  lm.it..  .-'  nl  .  '  t hi-  qu.111f.1n. 


mu  ’  tie  [ifia-e  - 
'  ’  :  hat  emerge 


and 


for  j  =  S,  P,  where  Ad  will  be  regarded  not  only  as  a  photode¬ 
tector  active  area  but  also  the  cross  section  over  which  the 
Kerr  interaction  transpires.  We  neglect  self-phase  modula¬ 
tion.  loss,  and  dispersion  and  assume  that  the  phase  shifts 
produced  by  cross-phase  modulation  are  very  small  com¬ 
pared  with  a  radian.  In  terms  of  the  spatially  integrated 
fields  we  then  have 

£ovrU)  =  |1+  /»£[(  Pp/hi'p)  +  2  IPpJhFp)'’- 

X  A/ipjff  —  rp)]\£{t  —  rp)  (145) 

£1N(t)  =  II  +  UL[(£)-Ad  +  2  (£)AdU2 

X  a£,U  -  rp)]|£p(f  -  rp),  (146) 

where  we  have  linearized  about  the  strong  mean  fields. 
Here.  1  is  the  cross-phase-modulation  coupling  constant,  rp 
=  Lie  is  the  propagation  delay,  and  subscript  1  denotes  first 
field  quadrature.  The  homodvne  measurement  on  the  in¬ 
loop  probe  field  £|N  is  arranged  to  sense  the  second  field 
quadrature.  The  operator  representation  for  the  resulting 
homodyne  photocurrent  is  then4 

1,  =  2qniPt.o/bi',.)1  •’KLtP/v'hiq.)'  •’(<£)■/!„ 


+  2<£'.4„'  2a£,U  -  T  )]  +  £,,:lt  -  r p )  1 


+  1, 


it). 
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where  Pui  is  the  local-oscillator  power  and  i,al  is  a  zero-mean 
while  Gaussian  noise  process  of  spectrum  q-r;i  1  -  ilPuifbep 
that  represents  the  17  <  1  quantum  noise  incurred  in  the 
homodyne  apparatus.  The  feedback  loop  is  closed  by  pass¬ 
ing  i.  through  an  amplifier  of  gain  K  =  hi')>l2>.L\PioPp'i‘  : 
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and  the  causal  linear  filter  q~]h/(t).  From  this  point  on,  it  is 
straightforward  to  derive  the  following  field-quadrature  and 
photocurrent  fluctuation  spectra  and  commutators  for  the 
out-of-loop  signal  field  £on<t): 

'^OCTl^OlTl^ 

1  +  \nPlH'[f)/hv-\-!iuL)HT,^PPthvP)yPsthi'S) 

=S  - 1 — - - - — — — » 

4ll  + 

(148) 

^ EoL-nEovrf-fi  =  4  '|1  +  4 UL)'(T)  (Pp/h»p)(Ps/htis)], 


(149) 

S,,(/)  =  q(n\4nSEo^Eo^(f)  +  (1  -  I))], 

(150) 

[£oiT(fl-  =  0. 

(151) 

^OCT  ^  *1  =  5R  —  f  ). 

(152) 

where  < T >  =  (P,,/'P>)/[1  +  (vP\/hi’<)H,'(0)]  is  the  mean 
beam-splitter  transmission. 

We  see  from  Eqs.  (148 1  and  (150)  that  when  the  probe 
power  is  sufficiently  large  the  free-field  output  £out  will  be 
first-quadrature  squeezed,  within  the  loop  bandwidth,  and 
yield  a  sub-shot-noise  out-of-loop  photocurrent  spectrum 
over  this  frequency  range.  We  see  from  Eq.  (1491  that  the 
second -quadrature  (phase)  noise  on  this  output  beam  has 
been  raised  above  the  coherent-state  level  by  the  photon- 
flux  (first-quadrature i  probe  noise  introduced  through  the 
Kerr  interaction  [cf.  Eq.  (145)].  Moreover,  this  back  action 
exactly  reclaims  the  appropriate  free-field  commutator  rela¬ 
tions  (Eqs.  ( 151 )  and  ( 152))  and  therefore  forces  the  uncer¬ 
tainty  product  obtained  from  Eqs.  ( 148 (  and  ( 149i  to  satisfy 
formula  i119i.  Indeed,  if  we  adjust  the  probe  power  Pr  to 
minimize  this  unceriaititv  product  at  a  selected  frequency  /,.. 
we  get 

min  i-S,  l  (/, (S.  ,  i/.  ii 

1 1  +  n'  'P  ,H, ’(/,,)  h i , I ■ 

= - - - - r  >  1/16.  (153) 
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C.  Quasi-State  Synthesis 

The  overwhelming  majoritv  of  our  analvsis  of  closed-loop 
photodetection  has  dealt  with  linearized  feedback  loops. 
We  close  by  returning  briefly  to  the  general  nonlinear  con¬ 
struct.  Our  goal  is  to  show  how  the  preceding  closed-loop 
field-extraction  techniques  lead  to  an  in-pnnciple  synthesis 
procedure  for  producing  an  open-loop  quantum  light  beam 
ut  arbitrary  prescribed  direct -detection  i  pholooount  (  statis¬ 
tics. 

Consider  the  open-loop  photodetection  of  a  quantized 
litnt  oeain  using  a  detector  ot  unite  quantum  efficiency  If 
the  densite  operator  iMute*  oi  thi-  beam  is  totally  uncon- 
--trained  by  previous  inlormation.  then  the  resulting  photo 
detection  event  time-  will  comprise  a  similarlv  uncon- 
-trained  *>E1‘! ’  Now  ( i insider  the  ciosed -  loop  photodetec  - 
tioit  -v-tem  :::  K:g  w  rtere  once  again  mute  quantum 

eliiiienc\  i-  .is-umeii  Smipose  ihat  the  input  light  is  ,i 
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highly  intense  beam,  and  the  optical  modulator  is  an  infi¬ 
nite-bandwidth  electro-optic  intensity  modulator  driven  by 
an  arbitrary  causal  time-varying  nonlinear  system  run  from 
the  photocurrenL  Then  this  photocurrent  can  also  be  an 
arbitrary  SEPP^  In  other  words,  using  the  semiclassical 
closed-loop  theory,  which  is  valid  when  the  input  light  is  in  a 
classical  state,  we  can  synthesize  any  SEPP  by  choosing  the 
feedback  function  to  be  such  that  the  power  falling  on  the 
photodetector  satisfies52 

[P,(|tf,Nf)))/h,0  =  Mf,  (1541 

where  u,  is  the  conditional  rate  function  for  the  desired 
SEPP.  By  this  technique,  we  can  use  coherent-state  light  to 
create  a  closed-loop  beam  whose  direct-detection  statistics 
match  those  of  any  desired  (classical  or  nonclassical  state  i 
open-loop  beam.  It  should  be  clear  from  Subsections  4.A 
and  4.B  and  IV. B  that  quantum-beam  correlations  of  either 
the  photon-twins  or  Kerr-effect  QND  variety  will  permit 
extraction  of  said  closed-loop  field,  leading  to  the  aforestat- 
ed  synthesis  procedure  for  open-loop  beams  of  arbitrary 
prescribed  direct-detection  statistics.  It  should  be  noted, 
however,  that  open-loop  direct-detection  statistics  do  not 
uniquely  determine  the  density  operator  of  a  quantum 
field,5  so  we  cannot  say  that  the  procedure  that  we  laid  out 
can  synthesize  arbitrary  prescribed  field  states.  Such  a  syn¬ 
thesis  procedure  may  perhaps  be  attainable  from  closed  - 
loop  heterodyne  detection,  inasmuch  as  it  is  known  that 
open-loop  heterodyne  detection  statistics  do  determine  the 
density  operator.5 
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Abstract.  The  photocounting  statistics  of  shot-noise  light  have  been  studied 
extensively.  Eight  exhibiting  such  statistical  properties  arises  in  photon- 
generation  processes  that  involve  a  two-stage  cascade  of  Poisson-based  events, 
such  as  cnthodolumincsccnce.  Expressions  for  the  photocounting  distributions 
are  complex  because  they  depend  on  the  overall  mean  of  the  distribution,  the 
photodelector  counting  time,  and  the  spectrum  of  the  light  A  simple  two- 
parameter  distribution,  the  Ncvmaii  Type-A  (N’l'A),  is  shown  to  provide  an 
excellent  approximation  to  the  photocountmg  statistics  of  shot-noise  light  with 
arbitrary  spet  ti  al  properties.  The  NT.A  distribution  therefore  plays  the  same  rule 
for  shot  -noise  light  that  the  negatix  e- binomial  distribution  plays  for  chaotic  light. 


1.  Introduction 

The  shot-noise-dri  veil  doubly  stochastic  Poisson  point  process  (SN’DP)  [1,2] 
and  the  Thomas  point  process  (TIM1)  [3|  arc  usclul  for  describing  sequences  of 
events  that  arise  from  a  cascade  of  two  Poisson-based  point  processes,  when  the 
temporal  ellects  of  the  cascading  process  are  taken  into  account.  These  point 
processes  have  found  use  in  a  variety  of  fields,  including  optics  |1  8),  vision 
[3,6,  9-1  1],  and  other  disciplines  ranging  from  biological  information  transmission 
to  astrophysics  [1,3,4,  12  I  4 1. 

The  SNDP  is  formed  f  rom  a  two-stage  Poisson  cascade,  in  which  each  event  of  a 
primary  Poisson  point  process  produces  a  virtual  inhomogeneous  rate  function, 
w  hich  in  turn  is  the  source  of  a  secondary  non-stationary  Poisson  point  process.  The 
secondary  processes  tire  superimposed  to  form  the  overall  (stationary)  point  process. 
Tlie  SN  DP  is  a  doubly  stochastic  Poisson  point  process  (DSPP);  it  is  also  a  special 
case  of  the  Neyman  Scott  cluster  process  [1,12].  The  primary  events  are  excluded 
from  the  overall  pom!  process  in  the  SNDP;  the  TPP  is  distinguished  only  by  the 
inclusion  of  the  primary  events.  It  hits  been  shown  that  the  SNDP  counting 
distribution  reduces  to  the  simple  Neyman  Type-A  ( NT  A )  1 1 , 4,  1  5]  w  hen  cascading 
occurs  instantaneously;  the  TPP  counting  distribution  similarly  reduces  to  the 
Thomas  (which  is  similar  to  the  NTA)  in  the  absence  of  time  dynamics  |3,  16) 

In  the  context  of  optics,  the  theory  has  been  used  to  characterize  the 
photocounting  statistics  of  shot-noise  light.  Light  exhibiting  such  statistn.il 
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characteristics  arises  when  the  photon-generation  process  involves  a  two-stage 
cascade  of  Poisson-based  events.  Examples  include  cathodoluminescence,  optical 
fluorescence  initiated  by  ionizing  radiation,  and  X-ray  radiography.  In  processes 
such  as  these,  both  the  impinging  excitations  (primary  events)  and  the  photon 
emissions  (secondary  events)  are  often  endowed  with  Poisson  statistics.  The 
presence  ol  Cerenkov  photons  can  be  accommodated  by  including  the  primary 
cv  cuts  [  3  |. 

Unfortunately,  expressions  for  the  photocounting  distributions  associated  with 
shot-noise  light  are  complex  because  they  depend  on  the  overall  mean  of  the 
distribution  <[n),  the  counting  time  T  and  a  function  li(t).  This  function  is  an 
inhomogeneous  rate  that  describes  the  time  course  of  secondary-event  production 
upon  the  arriv  al  of  a  primary  event,  and  it  determines  the  spectrum  of  the  light.  The 
principal  parameters  that  characterize  //(/)  arc  the  average  number  of  secondary 
ev  ents  per  primary  event  i  (the  area  of  //(/))  and  the  characteristic  secondary-ev  ent 
cluster  time  rp  (the  width  of  //(/)). 

The  purpose  ot  this  paper  is  to  investigate  the  validity  of  the  Ncyman  Typc-A 
distribution  as  an  approximation  for  the  photocounting  statistics  of  shot-noise  light 
w  ith  arbitrary  spectrum.  1 1  is  a  simple  distribution  characterized  by  two  parameters: 
its  mean  (>/,>  anil  a  multiplication  parameter  a.  'Die  choice  of  this  particular 
distribution  follows  f rom  theoretical  studies  demonstrating  that  the  SNDP  and  TIM* 
counting  distributions  approach  the  NTA  in  various  limits  [1,3].  Our  principal 
interest  is  in  the  region  [I  =  77ir s:  I,  where  the  specific  form  of  h(l)  is  likely  to  be 
important.  'The  counting  time  T  is  then  of  the  order  of  the  characteristic  splay-out 
time  of  the  secondary  pulses  ( rp >  so  that  the  validity  of  the  approximation  must  be 
determined  numerically.  'Die  NTA  approximation  turns  out  to  be  excellent  when 
h( I )  assumes  two  common  f  orms  ( rectangular  and  exponential),  for  all  regions  of  the 
parameters  2  and  /I.  Our  results  suggest  that  the  approximation  is  probably  quite 
suitable  for  arbitrary  /;(/),  2  and  //. 

The  approach  w  e  adopt  is  similar  to  that  used  by  Bedard,  Chang  and  Mandel  [  1  7] 
aiul  by  Perina  and  his  coworkcrs  [IS  2  I J.  'Die  former  study  dealt  with  the  use  of  the 
negativ  e-binomial  distribution  as  an  approximation  for  the  photocounting  statistics 
of  chaotic  light  with  arbitrary  spectrum.  The  latter  studies  were  concerned  with  the 
use  of  the  non-central  negative  binomial  (Laguerre)  distribution  as  an  approxim¬ 
ation  lor  thi'  photocounting  statistics  of  mixed  coherent  and  chaotic  light  [22], 


2.  The  NTA  counting  distribution 

The  NTA  distributed  random  variable  n,  of  mean  (ri)  and  multiplication 
parameter  a  (average  number  of  secondary  ev  ents  per  primary  event),  has  a  moment¬ 
generating  function  given  by  [1,15] 


yn(.v)  =  <exp(  -  5//);  =  exp 
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Its  variance  is 
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Tin-  usual  expression  for  the  N’l'A  counting  dist rihut ion  p(n }  takes  the  form  of  an 
infinite  series  [4],  with  a  recurrence  relation  of  the  form  |l] 

"  a1 

(;/+  !)/>(»  4-  !)  =  <//>  V  exp  (  —  a)  —  p(n  —j). 


/>(«)  =  exp  (exp(-«)-  l)J. 

When  (/—•(),  (I)  (3)  reduce  to  expressions  for  the  Poisson  distribution. 


3.  Comparison  of  the  NTA  and  SNDP  counting  distributions 

The  NTA  is  considered  a  candidate  for  approximating  the  SNDP  and  TPP 
counting  distributions  because  it  is  known  from  theoretical  arguments  that  it 
approaches  these  complex  counting  distributions  in  important  limits.  For  vanish¬ 
ingly  small  a,  or  for  vanishingly  small  /(  and  /fa,  it  has  been  shown  that  the  SNDP 
counting  distribution  approaches  the  Poisson  distribution  for  arbitrary  h(l)  [1,2]. 
This  is  because  the  secondary  pulse  clusters  are  then  either  very  sparse  (a «  1 )  or  are 
cut  apart  by  the  small  value  of  7’ (//«!).  Since  the  NTA  approaches  the  Poisson 
distribution  for  vanishingly  small  a,  it  certainly  is  a  good  approximation  to  the 
SNDP  in  this  region.  At  the  opposite  extreme,  when  /f »  1 ,  the  SNDP  counting 
dist  rihut  ion  also  approaches  the  NTA  for  at  bit  rare  /;(/)  [1,2].  This  is  because  all  ol 
the  secondary  pulses  are  then  captured  in  the  long  counting  time  T  since  7'»tp. 

The  comparison  is  effected  by  matching  the  NTA  mean  and  variance  (2)  to  the 
mean  and  variance  of  the  SNDP  counting  distribution,  which  is  given  by 


Yar  (n)  =  |  I  4-  ^  j<  h>. 


The  quantity  .  //  is  a  degrees-of-treedom  parameter  that  is  determined  by  h(t)  and  V\ 
When  //(/)  has  the  rectangular  form 


//(/)  = 


di  gt  ces-of -freedom  paiailietci  //(SNDP,  rci  t  )  heroines 


.//(SNDP.  reel )  =  • 


(  (/<- J/l2)  ',  t/K  I). 


N*-\V 


(/<£!). 


I  lie  usefulness  of  the  NTA  as  an  approximation  to  the  SNDP  counting 
distribution  with  spectral  properties  determined  In  the  rectangular  form  of  hit)  is 
illustrated  graphically  m  figure  1  Six  representative  SNDP  counting  distributions 
/i(  ii  I  ai  e  show  ii  versus  count  number  ii  for  \  a  nous  values  ol  the  S  N  I )  P  mult  i  pi  icat  ion 
parameter  a  and  the  capture  ratio  /f  =  7  r(.  These  distributions  were  calculated 
from  the  lei  ursion  relations  derived  earlier  [  I  J.  The  mean  count  (n)  is  I  (I  in  all  cases. 
Also  show  n  in  figure  I  are  six  NTA  dist  rihut  ions  \\  hose  mean  and  variance  ha  \  c  been 
matched  to  those  of  the  SN  DP  counting  distributions  b>  equating  ( 2 )  and  (4  ),  and  by 
using  (6)  to  solve  for  a  in  terms  of  a  and  /f.  I  n  several  panels  of  figure  I  .  the  tw  o  curves 
are  so  ncarh  identical  that  thc\  look  lift-  a  single  curve. 

In  figures  I  ( a )  (el,  a  is  chosen  to  he  unit  \  .  whrii.is  hi  Iil'iih  -  I  (i/l  (  /  i ,  a  Id. 
/f  —  1 II  Is  ,ii  the  (op  panels,  mci  easing  In  5  ( l  a  I  the  In  it  loin  pain  K  1 1  is  .ipp.u  cut  Irom 
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■'inure  2.  Sis  representing e  exponential  SNDl’  counting  distributions  compared  with 


NTA  distrihutions  of  the  same  mean  (<//)  =  10)  and  variance.  The  parameters  2  and  /I 
are  indicated. 


in  which  case  the  exponential  degrees-of-i  reedom  parameter  ./7(SNDI\  exp)  is 
given  by  1 1  ] 


.//(SNl)l’,  exp)  = 


2 l< 


exp  ( —  2[l)  +  2  [i—  1 


(8) 


Again,  various  values  of  2  and  /(  are  represented  in  the  individual  panels  of  figure  2, 
and  <m>=  10.  The  matching  of  the  variance  is  again  acc  nplished  by  equating  (2) 
and  (4),  but  now  (8)  is  used  to  give  a  in  terms  of  2  and  fl. 

In  figures  2  (u)  (r),  2=1,  whereas  in  figures  2(d)  (/),  2=10.  /(  increases  from 
0  0?  at  the  top  panels  to  .?  0  at  the  bottom  panels.  Apparently  the  NTA  and 
exponential  S\|)|’  dixl  i  ibut  ions  are  also  indistinguishable  (within  the  graphical 
accuracv  ol  the  figures)  when  2=1.  independent  of  the  value  of  />'.  In  this  ease, 
however,  their  is  a  bit  mine  ihsp.it  it  v  bet  w  ceil  the  NTA  and  the  exponent  la  I  SNDl’ 
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distribution  \\  lien  a.  =  10  ami  //  =  005  (figure  2  (</));  tins  is  expected  [  1  ].  In  I  act,  the 
NTA  misses  the  mark  by  a  bit  for  all  cases  in  which  a  =10.  Nevertheless,  a 
comparison  ol  figures  1  and  2  demonstrates  that  the  NTA  fares  just  about  as  well  in 
approximating  rectangular  and  exponential  SNDI’  counting  distributions.  The 
SNDP  distributions  appear  to  be  smoothed  versions  of  the  NTA,  in  which  the  sharp 
clips  of  p(n),  at  low  values  of  are  washed  out.  T  he  tails  of  the  distributions  are 
approximately  the  same. 

The  goodness  of  the  approximation  can  be  quantitatively  estimated.  To 
accomplish  this,  we  calculate  the  relative  error  S(n)  as  a  f  unction  of  the  count  number 
n,  where  S(n)  (in  %)  is  defined  as 

r  \p(n,  SN'DP)  — />(«,  N'TA)| 

(Km)=  00x-  ■—  F  .  (9) 

V  pin,  SNDI') 

d(n)  is  displayed  in  figures  3(n,b)  for  rectangular  SNDI’,  and  in  3(r,</)  for 
exponential  SN  DP,  counting  distributions.  The  values  of  a  and  / 1  are  indicated  in  the 
individual  panels  of  the  figure,  and  again  <Ti)  =  1  0.  The  error  is  oscillatory  \\  ilh  the 
count  number,  indicating  that  the  approximation  is  better  for  certain  values  of  n. 
Overall,  the  relative  error  is  often  quite  small,  but  it  can  become  large  for  particular 
\  allies  of  n.  IN  pec  tat  ions  based  on  then  ret  ical  arguments  are  borne  out .  The  NTA 
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provides  a  slightly  better  approximation  for  the  rectangular  SN DI*  distribution  (i.e. 
the  curves  lie  a  bit  lower)  than  for  the  exponential  SNDP  distribution.  The 
approximation  is  always  somewhat  better  for  /f  =  0  05  than  for  higher  values  of  /l. 

4.  Comparison  of  the  NTA  and  Thomas-process  counting  distributions 

The  NTA  also  approaches  the  TIM*  counting  distribution  in  certain  limits.  The 
TPP  counting  distribution  with  arbitrary  )i(t)  becomes  the  Poisson  distribution 
when  a  =  0,  or  when  (I  and  /ia  become  vanishingly  small  [3J.  The  first  conditions 
represents  a  degenerate  case  in  which  there  are  no  secondary  events  but  Poisson 
primary  events  are  carried  forward,  whereas  the  second  condition  represents  a 
situation  in  which  the  secondary-pulse  clusters  are  cut  apart  by  the  small  value  of  T 
(/}«  1).  Since  the  NTA  also  approaches  the  Poisson  when  a— >0,  it  clearly  provides  a 
good  approximation  to  the  TPP  counting  distribution  in  this  limit.  In  the  opposite 
case,  when  jl»  1 ,  all  of  the  secondary  events  are  captured  in  the  long  counting  time  T, 
along  with  the  primary  events,  and  the  TPP  counting  distribution  with  arbitrary  h(t) 
approaches  the  Thomas  distribution  [3,  4,  1  6],  which  in  turn  is  similar  to  the  NTA . 
Finally,  in  the  limit  // »  1 ,  a  »  1 ,  the  TPP,  SNDP  and  NTA  counting  distributions  all 
approach  the  fixed  multiplicative  Poisson  distribution  [ 4 J . 

The  variance  of  the  TPP  counting  distribution  is  [ 3 J 


v  ,,  1+3 a/./Z'  +  a2/.//,  , 

\  ar  (ti)  = -  - - <»»>, 

1  +  a 


(10) 


where  . //'  and  . //  are  Thomas  degrees-of-freedom  parameters  that  are  determined 
by  h(l)  and  T.  When  h(t)  assumes  the  rectangular  form  represented  in  (5), 


.//(TPP,  rect)  =  .//(SNDP,  rcct), 
as  given  by  (6)  and  [ 3 J 


.//  (TPP,  reel)  =  j  3(1  +  ^ 

U/M) 


-  l 


(//>!)• 


(11) 


(12) 


'The  NTA  and  TPP  counting  distributions  were  compared  in  the  same  way  as 
reported  earlier.  The  means  and  variances  of  the  NTA  and  the  TPP  distributions 
were  matched  by  equating  (2)  and  (10),  and  by  using  (6),  (11)  and  (12)  to  provide  an 
expression  for  a  in  terms  of  a  and  //. 

Six  representative  rectangular  TPP  counting  distributions,  along  with  six  NTA 
distributions  of  the  same  mean  and  variance,  are  displayed  in  figure  4.  The  TPP 
distributions  were  computed  from  the  recursion  relations  obtained  previously  [3]. 
The  mean  <(//>  was  again  chosen  to  be  10,  and  the  same  values  of  a  and  /(  were  used  as 
in  figure  1  for  the  SNDP.  I n  four  panels  of  figure  4  the  NTA  and  TPP  curves  are  so 
nearly  identical  that  they  look  like  a  single  curve.  Comparison  of  figures  1  and  4 
reveals  a  great  deal  of  similarity. 

Again,  the  goodness  of  the  approximation  may  be  quantitatively  established  by 
use  of  the  relative  error  6(n)  expressed  in  (9),  substituting  />(>/,  TPP)  for  pin,  SNDP). 
'The  result  is  displayed  in  figure  5.  The  values  of  yn),  a  and  /(  are  the  same  as  those 
used  for  the  SNDP  in  figures  3  («,/>).  Again,  the  NTA  approximation  is  somewhat 
better  for  //  =  005  than  for  higher  values  of  (S.  A  general  comparison  of  figures  5  and  3 
quickly  confirms  that  the  NTA  provides  as  suitable  an  approximation  for  the 
rectangular  TPP  distribution  as  it  docs  for  the  rectangular  SNDP  distribution. 
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I  iuiiic  4  Sin  i  cpresentat  n  e  ret  tangular  ’  I  'I  *1*  counting  distributions  compared  with  NTA 
ilisinhui unis  ul  tin-  same  mean  (y/i[>  —  10)  and  variance.  The  parameters  z  and  / i  are 
indn  .lied 


We  conclude  that  the  NTA  provides  an  excellent  approximation  for  the  SNDP 
and  TIM’  counting  distributions,  with  rectangular  and  exponential  impulse  response 
functions,  when  the  mean  and  variance  of  the  distributions  are  matched.  The  NTA 
distribution  therefore  plays  the  same  role  for  shot-noise  light  that  the  negative- 
binomial  distribution  plays  for  chaotic  light. 

Finally,  we  note  that  counting  distributions  for  cascaded  Poisson  processes  with 
an  arbitrary  number  of  stages  have  recently  been  calculated  [23,  24],  As  an  extension 
of  the  work  reported  here,  it  would  be  of  interest  to  examine  the  performance  of  the 
NTA  distribution  as  an  approximation  to  the  cascaded-Poisson  family  of  counting 
distributions.  A  similar  approach  may  also  be  useful  for  approximating  non-Poisson 
cascaded  processes,  such  as  those  used  for  describing  the  generation  of  antibunched 
and  sub-Poisson  light  ( 2 5 j .  Aside  from  the  NTA,  many  two-stage  simple  time- 
independent  counting  distnh  itions  have  been  developed  over  the  years. 


Pliulnt  Dunlin^  si  ill  i  si  us  nf  shot -muse  Ill’ll I 


I  177 


Number  cl  counts  n 
(/>) 

I  li!inr  5.  Rclal  t\  c  cmiii  i)(ii)  (|ht  lint)  versus  mimhcr  of  counts  li  Ini  nil.  inuiil.it  TPP 
immling  d  ist  1 1 1  >  i  ■  t  ii  ills  riles,-  emus  .ill-  nhlanicd  Irnni  tile  distributions  in  ligutc  -I. 
'ii  -lit.  y  .mil  />’  .in  ii nln  ;iieil 


Acknowledgment 

We  are  grateful  to  Sein  Lee  and  James  Ritcey  for  computational  assistance.  This 
work  was  supported  by  the  I’.S.  Joint  Services  Klectronics  Program  and  by  the  I  S. 
National  Science  Foundation. 


References 

(I  |  Sai  I  II,  Ii  K.  A.,  and  Ti  li'it,  M  1482,  Prof.  Ins!,  elcit.  t'liilnm  pnqi s,  70,  224-245. 
f 2 1  Ti  n  it.  M.  C.,  and  Sai.iii,  li.  17  A  .  1081.  Pins  R,v  A.  24,  1051  1054. 

[ 3 1  Maisio,  K.,  Trim,  M.  C7.  and  Sai. tat,  H.  17  A  ,  1 48. i,  ,•//>/>/.  Opt..  22,  1848  1404. 

[4 1  Tm  ii,  M.  C  ,  1481,  Appl.  Opt..  20,  2457  24(>7 

1 5 1  Sai.i-h,  li.  17  A.,  Tavoiacci,  J .  T  ,  ami  linn,  M.  t_7,  1481,  I  PEP.  Jt  quant  P.lvil  ran..  17, 
2.141  2150. 

|(i|  Sai.hi l ,  It,  17  A.,  and  Trim,  M.  (.7,  1481,  1P.PP.  Trans.  Inf.  Theory,  29,  414-441 
[  7 1  Sai.iii,  Ii.  17  A  ,  Sun. in,  I).,  ami  Ti  n  ii,  M.  (_\,  1481,  Phys.  Rev.  A,  27,  300  174 
( 8 1  llia.si  hum  ,  \\7,  and  Kiel,  S  ().,  1484,  J.  annpiil  Pin  s  ,  54,  284  124. 

1 4 1  linn,  M  (7,  anil  S \i  i  ii.  It  I .  A  ,  1481,  J .  opt.  Sm  Am  ,  7 1 ,  77 1  77(i 

( 10]  Ti  icii,  M.  C.,  Pin  i  n  m.,  IV  U  ,  Yanni  «  <  i ,  (7,  lim  ion,  M.  17,  and  Mitiu  i  ,  \Y.  j  ,  1482, 
J.  apt.  Sac.  Ain.,  72,  414  411 

1 1  1  |  Sai.iii,  li  1 .  A.,  ami  linn,  M  1 7 ,  1485,  Rml  ( '\l>c> ni  l  ,  52,  101  107 
[  I  2  I  \  IVM  IN',  |  ,  and  Si  o  I  I  ,  I  I  ,  .  1458.  .11  R  slain  I  Sm  li.  20,  1  4  1 

|  I  1 1  No  M  \  N,  J  ,  am  I  Pi  III,  P  S  .  I  47  Pi  III  null  .  I,  a, I  S,  i  /  ,S  .  I  .  7,1,  lltill  1  in  i 

[  1 4 1  I.awkiniI,  A  |  ,  1472,  ,8  f  hi  lunt  ii  Point  Piuiisms  Slalntiml  Analysis,  limns,  and 
.■Ippln  alums,  cd 1 1 ci I  1>\  PAW  I , mi  is  (  New  N Hi  k :  \\  ile\  - 1  n terse inn  i  ),  pp  144  2  5t» 

|  1  5 1  Nl-VMAN,  J  ,  1414,  .Inn.  math  slalnl  .  10,  15  57. 
f  1  f>  |  Tiiomxs,  M  ,  1444,  Hmnii  ii  lim,  16,  IS  2s 


I  17S 


Pholocoiinling  statistics  of  shot -noise  light 


f  1  7 1  Beharh,  C.,  Chang,  J  C.,  and  Manuel,  L  ,  1967,  Pins.  Res..,  160,  1496  1500. 

[ISJ  l*i  kin  a  ,  J  .  1967,  Pins  Lett..  24A,  333  334. 

[  19]  Pekina,  J.,  Pekinova,  V.,  Lag  ms,  G.,  and  Bhaunehova,  Z.,  1973,  Czech.  J .  Pliys.  B,  23, 
1008-1013. 

[20]  Perina,  J.,  1984,  Quantum  Statistics  of  Linear  and  Nonlinear  Optical  Phenomena 
(Dordrecht:  Reidel),  Chap.  5,  p.  148. 

[211  Hkadil,  Z.,  and  Pekina,  J.,  1986,  J.  opt.  Sue.  Am.  A,  3,  1328-1333. 

[22]  Sai.f.ii,  B.  E.  A.,  1978,  Photoelectron  Statistics  (Berlin:  Springer). 

[23]  Matsuo,  K.,  Saleh,  B.  L.  A.,  and  Teich,  M.  C.,  1982,  J.  math.  Pliys.,  23,  2353-2364. 

[24]  Matsuo,  K.,  Teich,  M.  C.,  and  Saleh,  B.  E.  A.,  1984,  J.  math.  Pliys.,  25,  2174-2185 

[25]  Teich,  M.  C„  Saleh,  B.  E.  A.,  and  Pekina,  J.,  1984,  J.  opt.  Soc.  Am.  B,  1,  366  389. 


36 


STATISTICAL  PROPERTIES  OF  COSMIC-RAY  SHOWERS  AT 


2663 


ACKNOWLEDGMENTS 

This  work  was  supported  by  the  National  Science 
Foundation  and  by  the  Joint  Services  Electronics  Pro¬ 
gram  at  Columbia  University.  We  are  grateful  to 
members  of  the  NASA  Goddard  Institute  for  Space 
Studies  for  providing  access  to  computer  and  plotting  fa¬ 
cilities,  and  for  assistance  in  locating  various  NASA  do¬ 
cuments.  William  Mindel  is  also  acknowledged  for 
computer-related  support.  We  appreciate  greatly  the 
participation  of  Sridhar  Anandakrishnan,  Ned  Peirce, 
and  Todd  Larchuk  in  various  phases  of  the  experiments. 


APPENDIX  A:  RELATIONSHIP 
BETWEEN  YULE-FURRY 
AND  LOGARITHMIC  COUNTING  STATISTICS 


The  statistical  behavior  of  logarithmic  counting  statis¬ 
tics  was  shown  in  Sec.  II  A  to  resemble  that  of  the  pure- 
birth  (Yule-Furry i  model  The  purpose  of  this  appendix 
is  to  present  a  connection  between  the  two  counting  dis¬ 
tributions 

For  a  general  discrete  probability  distribution  P(x), 
we  consider  the  expectation 


<x  > 


p\x  i= : 


(All 


which  is  clearly  normalized  This  leads  us  to  view  the 
summ.d  expression  abose  as  a  probability  transforma¬ 
tion  of  the  type 

PB<x  >  =  [x  /(x  A  >)P4(x)  ,  (A2) 

from  which  the  auxiliary  condition 

(x,,)  =  F4  +  (xa)  (A3) 

may  be  obtained,  where  FA  is  the  Fano  factor,  or 
sariance-to-mean  ratio,  of  the  original  distribution. 

It  is  evident  from  the  construction  of  Eq.  (A2)  that 
Pfl(x )  is  zero  when  x  =0,  although  this  need  not  be  true 
for  PA(x).  The  former  distribution  may  thus  belong  to 
the  class  of  unit-shifted  counting  distributions,  whose 
members  can  represent  cascades  in  which  the  primary- 
particle  is  included  along  with  its  offspring  (as  discussed 
in  Sec.  II  A). 

If  P^lx)  is  the  Poisson  distribution,  we  employ  Eqs. 
(1)  and  (3)  to  evaluate  Eqs.  (A2)  and  (A3): 

PB(x)=  (xA  )*  'exp(  -  (x  A  )  )/(x  —  1 )!,  x  >  1  , 


(A4a) 

(xg  )  =  1  +  (xA  )  .  (A4b) 

This  is  just  a  unit-shifted  version  of  the  Poisson  input, 
which  can  be  easily  verified  by  comparison  with  Eq.  (5i. 

On  the  other  hand,  if  PA(x)  is  the  logarithmic  distri¬ 
bution,  then  with  the  aid  of  many  of  Eqs.  i7)-(13)  we  in 
fact  obtain  the  Yule-Furry,  or  shifted  Bose-Einstem, 
statistic 


P„(*)  =  exp( -P )[  I  — exp(  -/?)]'  ',  x  >  1  , 


(xB  )  =  1  +d(xA  ),  or  6  =  P  ( A 5b i 

The  simplicity  of  the  transformation  that  links  the 
Yule-Furry  and  logarithmic  distributions  is  reflected  by 
their  shared  mathematical  properties. 

We  may  recast  Eq.  (A2)  into  equivalent  relations  in 
the  domain  of  generating  functions.  The  most  compact 
result  is  obtained  via  the  probability-generating  function 
(PGF),  which  is  defined  as 

G(z)e(z,)=2z,F(x).  <A6) 

x  =0 

Applying  Eq.  (A6)  to  Eq.  (A2)  and  using 

Gg\z)=zGB(z)  ,  (A7) 

which  relates  the  PGF’s  of  the  shifted  and  unshifted 
counting  distributions,  respectively,  we  obtain 

Gg<z>=—! l-Jj-GA(z)  ,  (A8a) 

' x  a  >  dz 

and  its  inverse 

Ga(z)=  (xa  >  [*Gg(z)dz  .  (A8b) 

J  o 


From  Eq.  (A8b)  we  can  see  that  the  logarithmic  PGF 
behaves  like  an  unbounded  cumulative  version  of  the 
(unshifted)  Bose-Einstein  PGF.  The  moments  of  these 
two  distributions  can  be  linked  to  all  orders  by  evaluat¬ 
ing  Eq.  (A8)  at  z  =  exp(  —  j)  to  obtain  relations  for  the 
moment-generating  function  Qis),  whose  properties  are 
given  by  Eqs.  (Bl)  and  (B2)  of  Appendix  B. 


APPENDIX  B:  THIRD-ORDER  MOMENTS 
FOR  A  CASCADED  PROCESS 

It  is  well  known  that  the  nth  ordinary  moment  of  a 
general  counting  distribution  P(x  I  may  be  obtained  from 
its  moment-generating  function  (MGF)  (Ref  1-U, 

Qxls)=  <exp(  —  sx  )  > 

or 

=  2  e*P<  -s*1^**'  •  (Bl) 

*  =0 

by  successive  differentiation  according  to 

1 

<x">  =  (-l)"  —  Q,\s)‘  for  all  n  .  (B2) 

ds"  1,-0 

Alternatively,  it  is  oftentimes  more  convenient  to  em¬ 
ploy  the  semi-invariant  or  cumulant-generating  function 
(CGF) 

A.tslslnt^jlr)]  ,  (B3) 

which  yields  the  mean  (x),  variance  I'(x),  and  third 
central  moment  M)(x)  directly 

a  1 
(x  )=  -  -q-AjIsl  1 


<A5a) 


(B4a> 
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gradual  decrease  in  probability  between  adjacent  counts 
beyond  n  =  1  in  all  our  experimental  distributions;  it  is 
typically  shallower  than  that  required  by  the  above  esti¬ 
mate.  We  conclude  that  the  afterpulsing  contribution  is 
not  crucial  and  need  not  be  considered  in  our  models 

Luminescence  can  arise  from  the  excitation  and  relax¬ 
ation  of  molecular  states  in  the  faceplate  material  of  the 
PMT,  and  has  been  studied  extensively  in  connection 
with  the  effects  of  the  radiation  environment  on  space- 
borne  photomultiplier  tubes.34-37  Unlike  Cherenkov 
photons,  which  are  copiously  produced  but  cannot  be  in¬ 
dividually  resolved  at  the  detector,  the  less  numerous 
luminescence  photons  are  emitted  during  the  compara¬ 
tively  long  molecular  relaxation  times  and  may  thus  be 
registered  as  individual  events.  In  fluorescence  decay, 
this  deexcitation  commonly  occurs  from  over  a  few  to 
hundreds  of  microseconds,  whereas  in  phosphorescence 
decay  times  may  span  from  minutes  to  days.  Of  the  two 
processes,  the  former  is  likely  to  be  most  significant  to 
our  counting  experiments  because  its  decay  parameter 
can  be  well  accommodated  within  the  chosen  counting 
times. 

A  statistical  estimate  of  fluorescence  noise  is  very 
difficult  to  obtain,  unless  the  correlation  between  cosmic 
rays  and  fluorescence  photons  at  the  detector  is  known. 
This  relationship  cannot  be  deduced  following  a 
photon-counting  experiment  because  knowledge  of  the 
occurrence  times  and  pulse  heights  of  individual  events 
is  lost.  For  this  reason  we  cannot  totally  exclude 
fluorescence  from  our  experimental  measurements,  al¬ 
though  arguments  against  its  dominance  can  be  set 
forth. 

If  cosmic-ray  particles  were  completely  divorced  from 
fluorescence  emissions,  the  latter  would  simply  add  to 
the  uniform  background  of  random  events,  which  is  nat¬ 
urally  incorporated  into  our  models  via  the  dark-rate  pa¬ 
rameter  nn.  A  strong  correlation  between  the  two  phe¬ 
nomena,  on  the  other  hand,  would  imply  that  a  cosmic- 
ray  count  is  closely  followed  by  fluorescence  events,  thus 
falsely  enhancing  the  shower  clustering  properties. 

However,  recent  numerical  simulations  by  Howell  and 
Kennel  of  NASA  (Ref.  35 1  have  shown  that  existing 
correlations  between  Cherenkov  and  fluorescence  pho¬ 
tons  in  the  PMT  window  are  substantially  reduced  en 
route  to  detection  at  the  photocathode  According  to 
their  simulated  bivariate  distribution,  the  latter  may 
even  at  times  appear  without  the  presence  of  the  former 
“This,”  the  authors  suggest,  "may  be  largely  attributed 
to  the  directed  nature  of  the  Cherenkov  radiation,  in 
which  photons  exit  the  window  with  little  or  no  internal 
reflection,  but  the  fluorescence  photons,  in  their  more 
random  flights,  eventually  strike  the  photocathode  either 
directly  or  after  reflection." 

Some  evidence  in  favor  of  this  view  may  be  gathered 
from  our  data  For  instance,  the  experimental  interevent 
time  distributions  shown  in  Figs  14(b)  and  15  reach  lo¬ 
cal  minima  precisely  within  the  temporal  region  associ¬ 
ated  with  fluorescence  photon  arrivals  (1-100  jzsec) 
This  is  particularly  true  of  the  Hamamatsu  tube,  perhaps 
because  its  solar-blind  response  reduces  fluorescence  in 
the  visible  regime.  Strong  correlations  would  most  cer¬ 


tainly  yield  an  enhancement  in  this  emission-time  win¬ 
dow 

The  small  excess  clustering  that  exists  in  the  Apollo 
17  counting  distributions  from  space  with  very  long 
counting  times  (see  Fig.  2)  also  supports  the  above  inter 
pretation,  especially  since  the  MgF2  window  of  the  asso¬ 
ciated  PMT  fluoresces  more  efficiently  than  the  SiO,  and 
glass  materials  used  by  the  other  photomultipliers  ' 

We  believe  fluorescence  is  not  sufficient  to  fully  ac¬ 
count  for  the  observed  geometric  feature  that  is  so  well 
described  by  our  models  (see  Fig  13).  Nevertheless,  fu 
ture  experiments  employing  point-process  detectors  in 
conjunction  with  pulse-height  analyzers  would  more  pre¬ 
cisely  elucidate  the  contribution  of  fluorescence  noise  to 
the  clustered  photomultiplier  background 


IV.  CONCLUSION  AND  DISCUSSION 


We  have  shown  that  cosmic-ray-shower  fluctuations  at 
ground  level  can  be  effectively  represented  by  the 
Poisson-driven  Yule-Furry  or  the  negative-binomial 
(Poisson-driven  logarithmic)  two-parameter  counting 
probability  distributions.  The  agreement  between  data 
and  theory  determined  with  photomultiplier  tubes 
operated  in  the  dark  is  remarkable,  given  the  simplicity 
of  the  underlying  pure-birth  cascade  model  The 
mathematical  basis  for  the  success  of  the  theory  lies  in 
the  presence  of  mean-square  excess  fluctuations  in  the 
variance  of  the  multiplication  statistics,  of  which  the 
Yule-Furry  and  the  logarithmic  distributions  are  funda¬ 
mental  examples.  In  order  to  extract  the  cosmic-ray- 
shower  component  from  the  PMT  data,  we  found  it 
necessary  to  formulate  a  more  precise  description  of 
photomultiplier  background  events  by  combining  the 
Poisson-driven  cascades  with  an  independent  Poisson 
random  variable  that  summarizes  other  count  sources  at 
the  detector. 

Finally,  we  note  that  cosmic-ray  clusters  are  particu¬ 
larly  deleterious  to  some  counting  experiments  We 
have  shown  that  cosmic-ray  clusters  can,  in  fact,  be 
avoided  in  photon-counting  experiments  by  restricting 
the  counting  duration  to  less  than  tens  of  seconds.  The 
background  events  can  then  be  described  by  a  simple 
Poisson  counting  distribution  with  a  very  low  mean 
(which  is  just  a  Bernoulli  distribution  when  only  0  or  1 
counts  are  observed).  This  is  particularly  useful  for  ap¬ 
plications  that  seek  to  determine  counting  statistics 
which  are  sensitive  to  excess  clustering.  One  important 
example  is  the  detection  of  stationary  photon-number 
squeezed  light,  as  recently  reported  by  a  number  of  au¬ 
thors,33,38-40  since  the  superposition  of  super-Poisson 
cosmic-ray  counts  at  the  phototube  may  destroy  the 
nonclassical  observable.  In  general,  if  photomultiplier 
tubes  are  used  in  photon-counting  experiments  that  can¬ 
not  be  conducted  at  short  durations,  cosmic-ray  clusters 
may  pose  a  significant  limitation.  The  background  mod¬ 
els  we  have  presented  will  then  provide  a  useful  analyti¬ 
cal  point  of  departure. 
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Hamamatsu  type  R431-S  PMT,  as  shown  in  Fig  15 

A  sharp  peak  appears  in  Fig  1 4Cb •  at  n  0.5  psec;  this 
same  phenomenon  occurs,  and  is  especially  pronounced, 
in  Fig  15  at  r  =  0  12  psec  This  feature  is  attributed  to 
brief  but  intense  bursts  of  evenly  spaced  cosmic-ray 
events  of  unknown  origin,  which  drive  the  Fano  factor 
in  counting  experiments  well  above  unity  These  peaks 
permit  us  to  estimate  the  typical  intracluster  event  spac¬ 
ing  Since  the  smallest  counting  periods  in  our  counting 
experiments  (7”  =  10  pseci  are  well  in  excess  of  the  intra¬ 
cluster  separation,  it  is  clear  that  many  of  these  events 
are  indeed  captured 

The  broader  tails  of  the  two  distributions  (at  the  larg¬ 
est  values  of  n  are  exponential  and  reflect  intercluster 
separations  arising  from  the  cascade's  primary  (Poisson) 
process  Additional  counts  from  noncascading  sources 
at  the  PMT  are  also  present,  but  these  do  not  alter  the 
exponential  character  of  this  tail  since  they  are  also  ran¬ 
dom  events  The  quenching  of  both  distributions  below 
t=  100  nsec  arises  from  the  limited  electronic  resolution 
of  the  detectors  Typical  pulse  durations  'full-width  half 


maximum)  for  the  Hamamatsu  system  are  rated  at  about 
25  nsec. 


E.  Other  sources  of  dark  events  in  the  photomultiplier  tube 

The  phenomena  of  afterpulsing  and  luminescence, 
which  are  known  to  be  important  sources  of  spurious 
counts  in  certain  photomultiplier  tubes,  have  not  been 
incorporated  into  our  models.  Since  these  are  primarily 
tube-dependent  effects,  the  introduction  of  further 
branching  steps  to  the  mathematical  formulation  may 
not  always  be  necessary. 

Afterpulsing  is  known  to  arise  in  photomultipliers 
from  ion  bombardment  of  the  cathode  and  initial  dy- 
nodes.24  29  These  charged  agents  result  from  the  ioniza¬ 
tion  of  residual  and  leakage  gases  (such  as  hydrogen, 
helium,  and  nitrogen)  during  electromultiplication  The 
process  is  governed  by  the  ion  transit  time,  which  is 
broadly  distributed  (even  if  afterpulsing  from  the  dy- 
nodes  is  eliminated  by  proper  discrimination1  since  it  de¬ 
pends  on  the  mass  of  the  ionized  species  and  its  path 
length  back  to  the  photocathode 

A  simple  estimate  of  the  afterpulsing  probability 
Pa<T)  per  photocathode  event  can  be  obtained  by  con¬ 
sidering  a  cascade  of  Poisson  primaries  with  a  Bernoulli 
law  for  the  creation  of  secondary  afterpulses  Using  a 
laser  source  of  Poisson  photons  do  render  the  dark-rate 
contribution  negligible)  and  conducting  photon-counting 
experiments  a|  T  =  1  psec  and  D  =0.4  sec  Ho  enahle 
cosmic-ray  effects  to  be  excluded  from  most  of  the  ex¬ 
periments',  Teich  and  Saleh''  estimated  that  P0'  1 
/zsec i  =  1  6*  10  '  for  the  Hamamatsu  tube  For  larger 
counting  times,  an  enhancement  of  P_'T‘  is  expected, 
since  more  afterpulses  are  likely  to  be  captured  by  the 
increased  time  window 

Even  so,  this  is  not  sufficient  to  account  for  the  verv 


FIG  14  la1  Interevent  probability  density  P1  t)  versus  in- 
terevenl  lime  r  isec'  for  background  counts  in  a  RCA  type 
8575  PMT,  on  semilogarithmic  coordinates  The  approximate¬ 
ly  exponential  dependence  for  r>0.0l  sec  reflects  Poisson- 
distributed  arrival  times  The  data  set  was  collected  on 
7/19/82.  at  an  18  59  75  start  time,  and  contain  881002  sam¬ 
ples  f b i  The  same  data  presented  in  a  double-logarithmic  for¬ 
mal  The  peak  at  r  =  0  5  psec  reflects  cosmic-ray  intercluster 
event  spaungs  Even  the  smallest  counting  period  used  in  our 
counting  experiments  <7“  =  10  psec>  is  sufficiently  large  to  cap¬ 
ture  these  clustered  events 


FIG  15.  Interevent  probability  density  Pin  versus  in- 
lerevent  time  r  (sec1  for  background  counts  in  a  Hamamatsu 
type  R431-S  PMT.  on  double-loganthmic  coordinates  The 
distribution  resembles  that  of  Fig  1 4tb 1  The  pronounced 
feature  arising  from  clustered  cosmic-rav  effects  is  evident  »t 
r  =  0 .13  psec  The  data  set  was  collected  on  9/7/83  and  con¬ 
tains  500000  samples. 
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FIG.  11.  (a)  Primary  cosmic-ray  driving  rate  ^ r  isec  1 ),  ib' 
mean  multiplication  (m  ).  and  (ci  dark  (noncascadingi  driving 
rate  fi‘D  (sec  versus  counting  period  7  These  parameters 
were  extracted  from  the  sequence  of  experiments  shown  in  Fig 
9  via  the  NB°P  model  background  events  They  do  not  vary 
substantially  over  the  range  of  counting  times  displayed  A  to¬ 
tal  of  six  counting  distributions  out  of  50  were  noi  able  to  be  fit 
by  this  model. 

identification  of  the  two  components  of  each  model  . 
For  low  count  numbers,  the  shape  of  the  distribution  is 
inherited  primarily  from  the  Poisson  law  of  the  noncas¬ 
cading  events  i(nD  )  =/aD  fa  1.2,  whereas  (np)=tipT 
=  0.054).  On  the  other  hand,  the  quasigeometric  tail 
beyond  n  =5  is  characteristic  of  the  appropriate  PDYF 
and  negative-binomial  distributions;  it  provides  a  telltale 
signature  of  cosmic-ray  clusters  at  the  detector.  This 
dual  behavior  is  quite  adequately  captured  by  the  models 
we  have  considered. 

We  also  remark  on  the  manner  of  decomposition  of 
the  total  background  rale  in  Table  I  it  is  shown  that 


FIG.  12.  Counting  distribution  P(n;7l  versus  number  of 
events  n  for  the  Schlumberger  EMR  type  54IN-06-14  PMT 
designed  for  the  NASA/JPL  Galileo  mission  (9/16/77,  Base 
line  B-l  experiment!  operated  in  the  dark  with  counting  period 
T  =  40  /rsec  and  duration  £>  =  19  min  Open  circles  represent 
experimental  values  Solid  and  dashed  curves  represent  the 
PDYF«P  and  NB«P  theoretical  models,  respectively  The  ex¬ 
perimental  and  theoretical  parameters  are  displayed  in  Table  I 
This  distribution  resembles  that  for  (he  Hamamatsu  PMT 
displayed  in  Fig  7  except  that  larger  clusters  are  observed 


FIG  1 .1  Counting  distribution  /  n  T  versus  number  of 
events  n  for  the  RCA  type  *575  PMT  (7/8/82.  17  014b 
operated  in  the  dark  with  counting  time  7  =  10  msec  and  dura 
lion  D  =  3  d  Open  circles  represent  experimental  values  The 
solid  and  dashed  curves  represent  the  PDYF°P  and  NB°P 
theoretical  models,  respectively  All  experimental  and  theoret 
ical  parameters  are  displayed  in  Table  I  The  underlying  Pois¬ 
son  distribution  for  noncascading  sources  is  clearly  evidenl  for 
low  count  numbers  In  <  5 1,  whereas  the  characteristic  tail  from 
the  Poisson-driven  cascade  counting  distributions  is  observed 
for  large  count  numbers  (n  >5)  Both  models  capture  this 
dual  behavior  with  remarkable  precision 


the  dark  rate  for  this  counting  distribution  received  the 
largest  share  in  such  a  way  as  to  maintain  the  primary 
rate  estimate  within  an  order  of  magnitude  of  those  for 
the  Hamamatsu  tube  experiments.  This  is  not  unreason¬ 
able  since  it  is  likely  that  noncascading  sources  (such  as 
thermionic  emission)  and  discriminator  adjustments  will 
vary  from  one  experimental  setup  to  another. 

D.  Interevent-time  probability  density  functions 

The  experimental  counting  distribution  P<n;T)  is  an 
important  statistical  measure  of  a  point  process.  The 
process  of  counting,  however,  forsakes  a  knowledge  of 
the  occurrence  times  of  the  individual  events  and,  there¬ 
fore,  provides  only  a  partial  characterization  of  the  en¬ 
tire  point  process.  Another  window  on  this  information 
is  the  probability  distribution  Pir)  of  the  interevent 
times,  which  is  constructed  experimentally  by  recording 
a  histogram  for  the  temporal  separations  of  adjacent 
pulses  that  are  properly  discriminated  (see  the  discussion 
on  pulse  discrimination  in  Sec.  Ill  A).  For  the  homo¬ 
geneous  Poisson  point  process,  this  distribution  follows 
the  simple  exponential  law  P(r)  =  /x  exp( -^iT),  where  ji 
is  the  constant  driving  rate  of  the  process  * 

In  Fig.  14(a)  we  present  P(r)  for  the  RCA  type  8575 
PMT  operated  in  the  dark.  On  these  semiloganthmic 
coordinates,  the  experimental  distribution  may  indeed  be 
well  approximated  by  a  linear  trend  for  large  interevent 
times,  but  this  Poisson  description  fails  for  r<0.01  sec 
The  deviation  can  be  scrutinized  by  a  double-logarithmic 
presentation  of  this  same  data,  as  shown  in  Fig.  14(b)  A 
similar  distribution  has  been  obtained  with  the 
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FIG  8  'a  Counting  distribution  P  n.T  versus  number  of 
counts  n  for  the  Hamamatsu  type  R431-S  PMT  '11/14/83, 
05  16  15-  operated  in  the  dark  with  counting  time  T  =44  ^sec 
and  duration  D  -  1000  sec.  The  open  circles  represent  experi¬ 
mental  values,  whereas  the  solid  and  dashed  curves  represent 
the  PDYF°P  and  NB°P  theoretical  models,  respectively  All 
experimental  and  theoretical  parameters  are  displayed  in  Table 
I  Note  the  presence  of  a  dense  cosmic-ray  burst  at  n  =  16 
which  reduces  the  quality  of  the  fit  <b '  Counting  distribution 
for  the  same  experiment  after  removal  of  the  singular  dense 
shower  at  rt  =  16  and  renormalization  of  the  data  Now  note 
the  similarity  to  Fig  7,  The  Fano  factor  decreased  from  1  54 
to  1.36 


1.44  sec"1  for  the  two  models.  These  ob¬ 
served  rates  are  an  order  of  magnitude  higher  than  that 
which  is  expected  from  an  estimated  sea-level  flux  of  1.0 
cm'2  min"1  (Ref.  271  through  the  2.5-cm  Hamamatsu 
photocathode  (i.e.,  0.1  sec”1).  The  discrepancy  may 
anse  from  the  superior  resolution  of  this  PMT  operated 
in  the  photon-counting  mode,  which  can  detect  low- 
height  Cherenkov  pulses. 

5.  Counting  probability  distributions  for  other  PMVs 

These  observations  are  complemented  by  two  addi¬ 
tional  counting  distributions  obtained  with  PMT’s  other 
than  the  Hamamatsu.  The  former  used  a  counting 
period  similar  to  that  employed  in  our  second  sequence 
of  experiments  (7"  =  40  gtsec,  duration  D  =19  min)  with 
the  Schlumberger  EMR  type  541N-06-14  PMT.  This 
PMT  is  similar  to  that  to  be  used  in  the  now  pending 
NASA/JPL  Galileo  mission  to  Jupiter.  The  distribu¬ 


FIG  9  (a)  Mean  count  (n  )  (in  units  of  10” 5)  and  (b)  Fano 
factor  F„ ITl  versus  counting  time  T  (yiscc)  for  the  sequence  of 
experiments  illustrated  in  Fig.  6,  after  removal  of  large  count 
numbers  resulting  from  intense  bnef  cosmic-ray  bursts.  The 
mean  count  was  not  significantly  affected,  but  the  occasional 
high  peaks  in  the  Fano  factor  that  are  prevalent  in  Fig  6  are 
eliminated.  Nevertheless,  the  counting  distributions  remain 
super-Poisson,  with  F.lTla  1.3,  in  the  range  of  counting  times 
displayed  The  variation  of  T  does  not  contribute  to  the  rais¬ 
ing  of  the  Fano  factor  above  unity;  rather,  this  results  from  the 
increased  experimental  duration  D. 


tion,  which  is  shown  in  Fig.  12,  resembles  that  for  the 
Hamamatsu  PMT  shown  in  Fig.  7,  except  that  larger 
clusters  are  observed. 

The  final  counting  distribution,  presented  in  Fig  13, 
represents  one  of  two  similar  experiments  carried  out 
with  an  RCA  type  8575  tube.  The  counting  time  and 
duration  were  especially  long  (T=10  msec,  D  =  3  d). 
Although  the  full  distribution  extended  to  the  count 
number  n  =137,  it  was  curtailed  at  n  =  50  and  renormal¬ 
ized  to  increase  clarity,  but  without  loss  of  essential  in¬ 
formation. 

This  experiment  is  appealing  because  it  facilitates  an 


(a)  '■ 


(b) 


(c)  •:  ■ 


FIG.  10.  (a)  Primary  cosmic-ray  driving  rate  (sec' 1 1,  lb' 
mean  multiplication  (m  )  and  (cl  dark  (noncascading i  driving 
rate  pD  (sec"'),  versus  counting  time  T  These  parameters 
were  extracted  from  the  sequence  of  experiments  shown  in  Fig 
9  via  the  PDYF«P  model  for  background  events  They  do  not 
vary  substantially  over  the  entire  range  of  counting  times 
displayed.  Only  one  experiment  (For  T  =  24  psecl  out  of  50 
was  not  able  to  be  fit  by  the  model. 
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FIG.  6.  (a)  Mean  number  of  events  In)  (in  units  of  10 "5I 
versus  counting  time  T  (/tsec)  registered  by  a  Hamamatsu  type 
R431-S  PMT  (11/13/83,  start  time  19:3000)  The  counting 
time  T  increases  for  each  successive  experiment  beginning  with 
T  =  10  jjsec  and  ending  with  T  =  59  jx sec,  while  the  duration 
D  =  1000  sec  throughout  The  time  required  to  collect  the  en¬ 
tire  set  of  50  counting  distributions  was  a  14  h  The  mean 
count  has  an  approximately  linear  dependence  on  T  with  a 
slope  of  3  sec*',  (b!  Fano  factor  F„l7" i  versus  counting  time  T 
for  the  same  set  of  experiments.  The  minimum  value  for  the 
Fano  factor  is  1  19,  whereas  in  Fig  4(bi  it  is  just  the  unity 
Poisson  barner  This  is  a  consequence  of  the  increase  in  dura¬ 
tion  to  1000  sec.  which  results  in  some  cosmic-rav  clusters  al¬ 
ways  being  present  in  each  experiment,  so  that  it  is  no  longer 
possible  to  avoid  their  effect  on  the  photomultiplier-tube  back¬ 
ground 

A  typical  counting  distribution  that  lacked  dense 
bursts  is  shown  in  Fig  7  (T  =41  jtsec;  see  Fig  6i.  Both 
models  share  the  ability  to  reproduce  the  nearly 
geometric  tail  of  the  distribution  with  excellent  precision 
over  7  orders  of  magnitude.  The  presence  of  a  dense 
burst,  on  the  other  hand,  reduces  the  quality  of  both  fits 
substantially,  as  the  counting  distribution  of  Fig  8(a>  il¬ 
lustrates  (T  =44  jjsec;  see  Fig  6), 

These  large  bursts  do  not,  however,  invalidate  the 
theoretical  approach  The  cascade  theory  describes  only 
the  general  properties  of  shower  fluctuations  and  is  not 
designed  to  accommodate  the  extremely  nonstationary 
behavior  of  the  dense  cosmic-ray  burst  at  n  =  16  in  this 
data  set.  Indeed,  it  is  of  interest  to  examine  the  data 
when  a  singular  burst  such  as  this  is  removed  from  the 
counting  distribution.  This  is  illustrated  in  the  renor¬ 
malized  distribution  shown  in  Fig.  8(b).  The  improve¬ 
ment  is  quite  dramatic  and  the  result  resembles  that 
presented  in  Fig.  7  (note  the  difference  in  scale).  The 
mean  and  Fano  factor  that  result  from  the  removal  of 
such  bursts  from  the  data  of  Fig.  6  is  shown  in  Fig.  9. 
Comparing  these  figures  shows  that  the  removal  of  the 
dense  cosmic-ray  activity  has  little  effect  on  the  mean 
count  but  significantly  reduces  the  Fano  factor.  Never¬ 
theless,  Fn(T)  retains  its  super-Poisson  character,  as  a 
result  of  the  residual  tails  of  the  counting  distributions 

4.  Extraction  of  model  parameters 

The  uniformity  of  this  revised  data  set  encouraged  us 
to  attempt  to  extract  the  stationary  properties  of  our 
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FIG  7  Counting  distribution  P(n\T)  versus  number  of 
counts  n  for  the  Hamamatsu  type  R431-S  PMT  (11/14/83, 
04  26  00i  operated  in  the  dark  with  counting  time  T  =  41  jxsec 
and  duration  D  =  1000  sec  Open  circles  represent  experimen¬ 
tal  values,  whereas  the  solid  and  dashed  curves  represent  the 
PDYF°P  and  NB®P  theoretical  models,  respectively.  All  ex¬ 
perimental  and  theoretical  parameters  are  displayed  in  Table  I. 
Both  models  provide  excellent  and  nearly  indistinguishable  fits 

three  model  parameters.  The  total  background-count 
rate  is  decomposed  into  a  cosmic-ray  primary  rate  and  a 
dark  rate  from  other  sources  at  the  detector.  The  last 
parameter  carries  the  multiplicative  strength  of  the  at¬ 
mospheric  cascade  The  theoretical  parameters  corre¬ 
sponding  to  the  two  models,  \(ip,(m  ),pD '■  and 
|,  are  given  by  Eqs.  132)  and  (35),  respec¬ 
tively.  These  were  extracted  from  each  counting  distri¬ 
bution  represented  in  Fig.  9  and  are  displayed  in  Figs. 
10(a)- 10(c)  and  11(a)- 11(c)  for  the  Poisson-driven  Yule- 
Furry  and  negative-binomial  models,  respectively. 

One  immediate  feature  of  the  results  is  the  stability  of 
the  mean  multiplication  that  is  obtained  from  both  mod¬ 
els  over  the  entire  range  of  counting  times.  From  a 
mathematical  point  of  view,  the  similarity  in  the  magni¬ 
tude  of  ( m  >  and  ( m ' )  (both  are  ~  1.2)  results  from  the 
proximity  of  the  inversion  parameter  ( T  » -f  i  to  the  re¬ 
gion  of  convergence  of  the  two  solutions  in  Fig.  3. 

Whatever  small  variation  exists  in  the  full  background 
rate  must,  therefore,  be  apportioned  to  the  primary  and 
dark  rates.  Those  for  the  PDYF  »P  are  about  the  same 
order  of  magnitude  i^p,pD  «  1.0  sec-  1 ),  whereas  the 
NBoP  model  places  a  slightly  greater  emphasis  on  the 
primary  rate  (ji^,  =  1.2  sec-1)  with  a  corresponding  de¬ 
crease  in  the  dark  rate  (gz^,  =  0. 7  sec-1).  This  compara¬ 
tive  reduction  in  fs'D  has  the  effect  of  reducing  the  fitting 
efficiency  of  the  NB®P  model,  for  its  dark-rate  estimate 
is  more  likely  to  be  driven  below  zero  on  occasion,  and 
thus  yield  an  unacceptable  solution.  This  is  evident 
from  the  gaps  in  the  histogram  records  of  Figs.  10  and 
11  (the  PDYF«P  model  successfully  fit  49  of  50  experi¬ 
ments,  while  the  NB®P  model  could  not  accommodate 
five  additional  cases). 

From  the  above  parameters,  we  find  the  total  fluctua¬ 
tion  of  cosmic-ray  particles  at  ground  level  in  this  range 
of  counting  times  to  be  about  np(m  )  a  1.2  sec"  1  and 


36 


STATISTICAL  PROPERTIES  OF  COSMIC-RAY  SHOWERS  AT 


2657 


The  resemblance  between  the  Poisson-driven  Y  ule- 
Furrs  and  the  negative-binomial  descriptions  of  cosmic- 
rav  showers  is  reflected  by  the  near  indislinguishability 
of  the  two  fits  in  Fig  5  and  in  those  to  follow.  The  con¬ 
volution  of  the  former  with  a  Poisson  distribution 
PDY'F  oPi  is  shown  bv  the  solid  curve,  whereas  the 
convolution  with  the  latter  (NBoPi  is  shown  by  the 
dashed-line  fit  We  adopt  this  notation  for  all  subse¬ 
quent  counting  distributions  Parameters  have  been  ex¬ 
tracted  from  the  data  for  each  model  in  accordance  with 
Eqs.  (32 1  and  (35);  these  are  collected  in  Table  I. 

In  a  second  photon-counting  sequence,  we  increased 
the  duration  D  of  each  experiment  to  1000  sec  and, 
furthermore,  changed  the  counting  time  T  from  10  grsec 
to  59  gisec  in  consecutive  experiments,  to  observe  the 
dependence  of  the  model  parameters  on  T  The  number 
of  samples  per  counting  distribution  naturally  decreased 
with  increasing  counting  time,  but  with  an  unsubstantial 
loss  of  accuracy;  at  T  =  59  psec  we  still  obtained  some  8 
million  samples  as  compared  with  48  million  samples  at 
T  =  10  gisec. 

We  present  the  mean  count  for  this  sequence  of  exper¬ 
iments  in  Fig  6(a)  as  a  function  of  counting  time  T.  The 
data  exhibit  reasonable  linearity  between  (n  >  and  T.  as 
expected;  the  larger  the  counting  period,  the  more  the 


number  of  events  from  the  point  process  that  enter  the 
counting  window.  The  proportionality  factor  provides  a 
measure  of  the  mean  rate  of  total  background  counts  for 
this  Flamamatsu  tube.  A  linear  fit  to  the  data  yields  a 
slope  of  about  3.0  sec"1,  which  accords  with  the  esti¬ 
mate  from  Fig.  4(a). 

The  Fano  factors  in  Fig.  6(b)  can  be  compared  to 
those  of  the  first  sequence  of  experiments  in  Fig.  4(b) 
(note  the  difference  in  the  ordinate  scales,  however).  The 
maximum  value  of  F„(T)  (6.98)  occurred  at  T  =  52  gisec 
(this  resulted  from  single  dense  bursts  of  62  and  64 
counts  each),  whereas  the  minimum  Fano  factor  was 
1.19.  It  is  evident  that,  under  these  experimental  condi¬ 
tions,  every  measurement  in  the  record  leads  to  a  super- 
Poisson  result,  so  the  effects  of  cosmic-ray  showers  on 
the  photomultiplier-tube  background  can  no  longer  be 
avoided  This  is  a  direct  consequence  of  the  increase  in 
the  duration  of  each  experiment  to  D  =  1000  sec  (=17 
min).  It  is  longer  than  the  entire  sequence  of  experi¬ 
ments  shown  in  Fig.  4(b),  thereby  unavoidably  leading  to 
the  capture  of  cosmic-ray  clusters. 

The  permanence  of  the  cosmic-ray  contribution 
throughout  this  data  set  provides  an  opportunity  for 
testing  the  applicability  of  our  models.  We  discuss  two 
distinct  examples  for  this  tube. 


TABLE  I  Experimemal  parameters  for  the  full  counting  disinbutions  presented  in  Sec  111  C 


Counting 

Duration 

Total 

Total  rate  —  1 

Fano 

Gamma 

Counting 

time 

of  run 

samples 

(n  )/T 

factor 

factor 

experiment 

Fig 

T 

D 

V. 

(sec'  ') 

F'<T  i 

r 

Hamamatsu  R431-S 

5 

10  gi  sec- 

10  sec 

480  353 

2.498 

1.500 

0.333 

12/02/83  17  15  45 
Hamamatsu  R431-S 
11/14/83  04  26  00 

7 

41  fi  sec 

1000  sec 

1 1  732  575 

2.732 

1.355 

0.310 

Hamamatsu  R431-S 

8ia 

44  gisec 

1000  sec 

10902  318 

2.952 

1.538 

0  142 

11/14/83  05  16  15 
Hamamatsu  R431-S 

8'  b  ■ 

44  gisec 

1000  sec 

10902  317 

2.918 

1.373 

0  341 

11/14/83  0516  15 
(burst  deleted 

EMR  541 N -06- 14 

09/16/77 

12 

40  gi  sec- 

19  min 

8  192  001 

58.972 

1.887 

0.146 

RCA  8575 

07/08/82  17  0146 

13 

10  msec 

a  3  d 

3  499  873 

149  460 

2  750 

0  065 

Theoretical 

parameters  extracted  from  fits 

to  the  data 

Counting 

Vp 

Hr> 

9, 

9r> 

experiment 

F'g 

(sec  ') 

( m  > 

(sec '  ') 

(sec  ') 

( m ' ) 

(sec  1 1 

Hamamatsu  R431-S 

5 

1  405 

1.333 

0  625 

2.026 

1.233 

6  241  x  10 

12/02/83  17  15  45 
Hamamatsu  R43I-S 

7 

0.844 

1  408 

1.544 

1,236 

1.282 

1  148 

11/14/83  04  26  00 
Hamamatsu  R43I-S 
11/14/83  05  16  15 

8'a1 

0  176 

2.683 

2  480 

0.314 

2  004 

2,323 

Hamamatsu  R431-S 

8(b) 

1.338 

1.310 

1.166 

1  919 

1218 

0.582 

11/14/83  05  16  15 
(burst  deleted) 

EMR  54 IN -06- 1 4 
09/16/77 

12 

6  194 

7..615 

42.775 

10.968 

1  969 

37,375 

RCA  8575 

13 

5.406 

5  444 

120.030 

11.990 

3.273 

110.220 
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ensure  a  reliable  estimate  of  the  distribution  P'n\T< 
The  total  time  required  to  collect  all  samples  is  called 
the  experimental  duration  D  In  the  case  of  measure¬ 
ments  taken  in  the  dark,  of  course,  the  PMT  is  not 
unshuttered 

We  have  also  operated  an  RCA  type  8535  PMT  under 
similar  conditions  using  a  2-kV  Kepco  type  188-00.30 
power  supply,  an  EGG  type  NA201/N  quadamplifier 
(8  >  I,  and  an  EGG  type  TR204/N  discriminator  Addi¬ 
tional  data  were  obtained  from  a  Schlumberger  EMR 
type  541V06-I4  PMT  being  used  in  a  study  of 
radiation-environment  effects  on  the  star  tracker  for 
NASA's  Galileo/Jupiter  mission  l! 

The  spectral  response  of  all  three  tubes  is  within  the 
Cherenkov  line  shape  The  Hamamatsu  device  was 
selected  for  us  high  (Cs-Te  photocathode  sensitivity  in 
the  ultraviolet  210  nm  .  whereas  the  other  two  PMT's 
employed  bialkali  materials  that  peak  in  the  blue  '400 
nm 

3  Counting  probability  distributions 

Two  main  sequences  of  experiments  were  carried  out 
using  the  Hamamatsu  tube  A  discriminator  setting  of  I 
V  was  used  to  ensure  that  the  recorded  events  originate 
in  the  predynode  environment  In  the  first  sequence,  the 
counting  time  T  and  the  duration  D  of  eacn  experiment 
were  fixed  at  10  gtsec  and  10  sec.  respectively  Given  the 
duty  cycle  of  the  statistical  counter  1x0.5’  and  a  5-sec 
processing  hiatus  between  experiments,  we  were  able  to 
generate  four  counting  distributions  every  minute  with 
about  480 (XJ0  samples  each.  Collecting  a  full  sequence 
in  this  set  required  over  an  hour,  but  we  focus  on  the 
first  12  min.  which  are  representative  of  the  entire 


record 

Some  of  the  experimental  data  is  displayed  (in  histo¬ 
gram  formi  as  a  function  of  elapsed  time  in  Fig  4  We 
depict  the  mean  count  <  n  >  of  each  counting  distribution 
P<rt,7:  in  Fig  4ia 1  and  the  associated  Fano  factor 
F„’Tl  in  Fig  4(b),  both  with  a  logarithmic  ordinate 
The  total  experimental  background  rate  is  a  2  sec  V 

The  Fano  factor  serves  as  an  effective  tool  for  discern¬ 
ing  the  underlying  clustering  phenomena.  Although  it  is 
essentially  unity  in  nearly  70%  of  these  experiments,  the 
Fano  factors  for  the  remaining  counting  distributions  are 
sufficiently  greater  than  unity  to  signal  the  presence  of 
shower  events  An  example  is  provided  about  halfway 
into  the  record,  where  there  was  an  obvious  deviation 
from  Poisson  behavior  that  iasted  for  over  a  minute 
The  shower  was  most  intense  during  the  counting  exper¬ 
iment  that  began  at  the  fifth  minute;  in  a  single  counting 
time  of  ID  gtsec  there  was  a  burst  of  32  events  and  the 
Fano  factor  reached  a  value  of  21  Our  choice  of  experi¬ 
mental  parameters  is  clearly  sufficient  to  resolve  the 
characteristic  arrivals  of  shower  particles  at  the  detector. 
This  behavior  could  not  have  been  inferred  by  analyzing 
the  evolution  of  the  mean  count  alone,  as  is  evident  from 
the  record  in  Fig  4ia). 

Some  of  the  counting  distributions  are,  however,  too 
sparse  to  provide  a  rigorous  test  of  the  models  under 
consideration. In  fact,  only  one  super-Poisson  experi¬ 
ment  in  the  12-roin  record  was  successfully  fit  by  our 
models;  it  is  shown  in  Fig.  5  and  corresponds  to  5  45 
elapsed  time  in  Fig.  4.  Although  a  triple  count  occurred 
only  once  in  that  experiment,  it  was  sufficient  to  drive 
the  Fano  factor  above  unity  and  exemplifies  the  vulnera¬ 
bility  of  Poisson  statistics  to  excess  clustering 


i  • 


(ai 


(b) 


FIG  4  < a >  Mean  number  of  events  (n  )  (in  units  of  10  'i 

versus  real  time  t  (mini  registered  by  a  Hamamatsu  type 
R4  31-S  PMT  ovei  a  12-min  period  112/2/83,  start  time 
17  lOOOi  The  counting  lime  T=  10  psec  and  the  duration 
D  =■  10  sec  for  each  experiment  Successive  experiments  take 
place  every  15  sec  The  total  background  rate  is  =2  sec  lb’ 
Fan o  factor  F„(7'>  versus  real  time  for  the  same  set  of  experi¬ 
ments  Note  that  F„(7T  =  1.0  for  about  70/7  of  the  counting 
distributions,  indicating  the  complete  absence  of  cosmic-ray 
clusters  It  stretches  to  a  maximum  of  21  for  the  remaining 
runs,  indicating  the  presence  of  bursts  The  cosmic-ray  contri¬ 
bution  to  the  photomultiplier  background  can  thus  be  general¬ 
ly  avoided  if  the  duration  of  an  experiment  is  limited  to  less 
than  tens  of  seconds 


10 


10 


FIG  5.  Counting  distribution  Fin,  T)  versus  number  of 
counts  n  for  the  Hamamatsu  type  R431-S  PMT  (12/2/83. 
17;  1 5  45 1  operated  in  the  dark  with  count  time  T  =  10  psec  and 
duration  D  =  10  sec  Open  circles  represent  experimental 
values  (note  that  P(2l  =  0l;  the  convolution  of  the  Poisson- 
driven  Yule-Furry  cosmic-ray  cascade  with  a  Poisson 
(PDYFoP,  is  shown  as  a  solid  curve,  whereas  the  negative 
binomial  cascade  convolved  with  a  Poisson  (NBo  Pi  leads  to  the 
dashed  curve  All  experimental  and  theoretical  parameters  arc- 
displayed  in  Table  I  The  single  triple  count  cluster  was 
sufficient  to  drive  the  Fano  factor  to  1.5 
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The  inversion  is  facilitated  by  defining  a 
auxiliary  parameter 

convenient 

F  =  [  E  l  n  i  —  (  n  >  ]  /[  M ,  i  n  t  —  F  ‘  n  t  ]  , 

(30' 

which  is  readily  calculated  from  the  statistics  of  P t  n  ;  Ti. 
Assuming  first  a  Poisson-driven  Yule-Furry  cascade,  we 
substitute  Eq  (23 1  into  Eq.  (29)  to  obtain 

<  n  )  =H[>T  prT  exp'/Ji  , 

( 3 1  a  > 

V  {ru  =  nDT  -'-prT  exp<  /? )[  2  exp(f?  i-  1  ]  , 

(3 1  bi 

M  -J  n  )  =  T  +  PfT  exp(/?)[6  expi  2  P  i  -  6exp(/?)-r  1  ]  . 

(31c) 

Combining  the  above  via  Eq.  (30>  provides 
equation  for  the  mean  multiplication  (m  >  = 
the  simple  roots 

a  quadratic 
exp(j9)  with 

(m >=  1  , 

(32a  i 

(  m  )  =  -,  [1-i-il/Ti]  . 

(32b  > 

Equation  132a1  represents  the  absence  of  branching  and 
is  uninteresting;  Eq  ‘ 3 2 b >  ts,  therefore,  the  unique  solu¬ 
tion  for  the  nontrivial  multiplicative  nature  of  the  cas¬ 
cade  as  seen  by  the  PMT  The  associated  driving  rates 
are  then  determined  from 

/Ir  r=  [(  3F  1-/2-  1-rn  1-2T  !]*/■„-  1  »<n  > 

(32c) 

ppT=|[(2-ri-i3r..Fr]/[2  l-2r.]|<n> 

(32d ' 

where  F„  is  the  experimental  Fano  factor.  Repeating 
the  technique  for  the  negative-binomial  cascade,  we  em¬ 
ploy  Eq.  (26)  to  yield 

{  n  )  =gr,  T  —  (  nf: )  , 

( 3  3a  i 

1'  i  n  *  =  fiDT  ■+  ( ti(,  )[  1  >  /M )  ]  , 

(33b* 

MPni  =  nDT+  <«„>[!  +f  <b„  >  /M  '][  1  +  2‘  < 

rtn )  /Aft]  . 

(33C 

Solving  for  the  negative-binomial  parameters,  we  obtain 

<n„  >  =•  [t  2T  '/•  1  —  2F  »]  F„  -  1  '<6  )  , 

(34a  t 

M  =--  [ <  2 r  i /( 1  -  2  r  1  ] ; '  E „  -  1  i(n  >  . 

( 34b  i 

Returning  to  the  parameter  of  the  logarithmic  distribu¬ 
tion  via  Eq  (25 -  (hen  yields  H  =  -  In'  2 T  i.  so  (hat 

<m>  =  [ln-2r  •)  '[l-i  l/2r>]  . 

(35a 

7  =  |n[  1  /i  2 F  < ] [ '  2 F  1  -  2T  ']■'/„ 

1  ><  n  )  , 

1 3  5b 

ii  7  -  ,  ( 1  2 T  <F„  ]  /'  1  -  2F  :  <  n  > 

( 35c 

The  resemblance  between  Eqs  ‘32  and  '351  |<,  a  direct 
consequence  of  the  similar  underlving  multiplication 
model1  that  are  discussed  in  Sec  II  A  We  have  primed 
the  latter  three  parameters  to  minimize  confusion  be¬ 
tween  the  two  models  Graphical  representations  lor  the 
mean  multiplications  <  m  )  and  <  m  '  )  are  presented  in 


Fig  j,  as  a  function  of  T,  for  comparison  The  region  of 
physical  significance  is  shaded  and  bounded  on  the  left 
by  infinite  multiplication  for  T  -  0  and  on  the  right  by 
unity  multiplication  for  7=5  This  follows  clearly  from 
the  expressions  T  =  1  /[  3  exp< P 1  -  1  )  and  r  =  fexp 
when  0  <  P  <  ac  and  0  <  6  <  rr. ,  respectively  The  loga¬ 
rithmic  factor  in  the  negative-binomial  result  modulates 
the  solution  in  just  the  right  way  to  produce  a  graphical 
trend  like  that  for  the  PDYF  in  this  domain 

2.  Experimental  apparatus 

We  now  present  the  results  of  photon-counting  experi¬ 
ments  conducted  with  several  different  photomultipliers 
in  the  dark  at  ground  level.  A  Hamamatsu  type  R43I-S 
tube  was  operated  at  ambient  temperature  on  the  13th 
floor  of  the  Seeley  W.  Mudd  building  of  Columbia  Uni¬ 
versity.  It  was  inserted  in  a  Hamamatsu  type  005(3 
photon-counting  base  that  provides  preamplification, 
discrimination,  and  pulse  shaping  An  Ortec  type  456 
power  supply  delivered  1025  V  to  the  PMT  The  output 
of  the  PMT  base  was  passed  through  a  buffer  amplifier 
(5x1  and  fed  to  a  Hewlett-Packard  type  5370A  rate- 
counter  and  a  Langley-Ford  type  1096  statistical  photon 
counter.  The  former  measures  the  detected  mean  pho¬ 
ton  count  rate,  whereas  the  latter  generates  the  probabil¬ 
ity  distribution  Pln,Tl.  The  experiment  was  controlled 
by  a  Hewlett-Packard  type  9825B  minicomputer 

Photon  counting  is  a  simple  experimental  procedure 
Each  sample  of  the  counting  distribution  is  obtained  by 
unshuttering  the  photosensitive  PMT  cathode  and 
recording  the  number  of  appropriately  discriminated 
events  that  occur  within  a  fixed  counting  time  7  By  re¬ 
peating  the  procedure,  enough  samples  are  then  taken  to 
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FIG  3  Graphical  dependence  of  the  mean  multiplication 
solutions  <  m  '  and  ( m  ’ )  on  the  experimental  inversion  param¬ 
eter  T  The  solution  represented  b>  the  solid  curve  is  obtained 
if  the  Poisson-driven  Yule-Furrv  'PDYF  cascade  is  incor¬ 
porated  into  a  model  for  photomultiplier  background  eventv 
whereas  the  solution  represented  hv  the  dashed  curve  results  if 
the  negative  binomial  'SB-  cascade  is  considered  The  region 
of  phvsual  significance.  •  T  » 0.  is  shaded  and  bounded  b\ 
units  and  infinite  mean  multiplications  rcspectivelv  The  two 
solutions  exhibit  similar  behavior  in  this  region 
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with  spacecraft  altitude  and  attitude,  but  was  indepen¬ 
dent  of  the  position  of  the  spacecraft  in  lunar  orbit;  that 
is,  the  background  signal  was  clearly  not  of  solar,  lunar, 
or  earth  origin.”  The  count  rate  varied  from  20  7  to 
28.3  sec-1  at  lunar  altitudes  of  26  and  312.5  km,  respec¬ 
tively,  as  compared  with  0  6  sec'  1  in  prelaunch  labora¬ 
tory  tests 

The  PMT  background  events  were  processed  into 
counting  probability  distributions  with  a  counting  time 
T  =0.  I  sec,  two  of  which  are  reproduced  in  Fig.  2.  The 
dashed  line,  which  represents  a  Poisson  distribution  with 
the  same  mean  as  the  experimental  data,  provides  a  good 
fit  in  both  cases.  A  small  excess  clustering  is  inferred 
from  the  raised  tail  of  the  experimental  distributions  and 
their  associated  Fano  factors,  which  are  slightly  greater 
than  unity  [1.08  and  1.13  for  Figs.  2(a)  and  2(b),  respec¬ 
tively],  Fastie  attributes  this  super-Poisson  effect  to  ad¬ 
ditive  fluorescence  photons  from  the  MgF,  window  of 
the  photomultiplier  tube  We  reserve  a  discussion  of 
fluorescence  for  a  later  section  of  this  paper  and  accept 
Fastie's  data  as  confirmation  of  the  independence  of  pri¬ 
mary  cosmic-ray  arrivals. 

C.  Cosmic-ray  counting  experiments  at  ground  level 

I  Models  for  noise  in  photomultiplier  tubes 
operated  n  the  dark 

Photomultiplier-tube  background  events  are  most 
often  assumed  to  obey  a  simple  homogeneous  Poisson 
law,  with  dark-count  index  nD,  reflecting  the  presence  of 
independently  arriving  events  generated  by  thermionic 
emission  at  the  photocathode  and  dynode  stages,  by  ra¬ 
dioactive  elements  such  as  40 K  in  the  window  material, 
and  by  a  number  of  other  effects,  some  of  which  are  dis¬ 
cussed  subsequently  24  *5  ‘  2”  The  counting  distribution 
PtnD,Ti  then  takes  the  form  of  Eq  ( I ). 

However,  the  data  at  ground  level  do  not  support  this 
simple  model.  Rather,  it  is  necessary  to  extend  this 
framework  by  incorporating  cosmic-ray  counts  from  a 
Poisson-driven  cascade  The  generalization  proceeds  in 
a  twofold  fashion  The  cascading  component  of  the 
cosmic  rays  has  been  shown  to  exhibit  super-Poisson  be¬ 
havior  via  a  counting  distribution  P<nr.T>  that  incorpo¬ 
rates  the  clustered  aspect  of  the  background  A  noncas¬ 
cading  cosmic-ray  component,  such  as  the  muon,  is  also 
registered  at  the  PMT  These  events  are  essentially  in¬ 
dependent  and  random  and  may  therefore  be  fully  ab¬ 
sorbed  in  the  Poisson  counting  distribution  Pin^.T' 
used  to  describe  the  thermionic  emission 

The  generalization  is  provided  by  the  independent  ad 
dllion  of  the  dark  index  nr,  with  the  cascade  index  n, 
The  overall  counting  distribution  Pin  ;7T  at  ground  lev¬ 
el  is  then  the  convolution  of  these  distributions 

Pin  ;T>-  Pin,  .T<oPfn„.T)  <28 > 

Their  statistics  up  to  third  order  are  purely  additive,”’ 
i  c  . 


(  n  )  -  (it,T  -  in,  )  , 
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FIG.  1  (a)  Sample  1  counting  distribution  Pin.T1  versus 

number  of  counts  n  from  a  Schlumberger  EMR  type  542G-CN 
photomultiplier  tube  in  space,  operated  with  counting  time 
7  =  0  1  sec  (data  adapted  from  Table  II  of  Ref  28  The  open 
circles  represent  experimental  values  from  5750  total  samples 
The  mean  count  (n  >  =  2. 85  and  the  Fano  factor  F, 1  T>  =  I  08 
The  dashed  curve  is  a  Poisson  distribution  with  the  same 
mean;  it  represents  an  excellent  fit  to  the  data  and  confirms  the 
independence  of  primary  cosmic-ray  arrivals  >b  Sample  2 
counting  distribution  from  ihe  same  source  In  this  case,  the 
experimental  mean  count  <n)  =  2  32  and  ihe  Fano  factor 
F. '7  i=I  13  from  5750  total  samples  Fasne  suggests  i hat  the 
excess  clusters  represented  by  ihe  raised  tail  of  the  distribution 
may  be  caused  by  luminescence  in  the  tube  window 


y  In  )=fjpT  +  Vinc  l  ,  i2db 

M  ,i  n  i  =  /x0  T  4  M  ,i  nr  ,  .  (2dc 

The  analytical  form  of  Pin,  77  will  clearly  depend  on 
the  choice  of  the  Poisson-driven  cascade  model 

Given  the  experimental  moments  obtained  from  the 
counting  distribution  Pin  ,77  of  a  PMT  operated  in  the 
dark,  it  is  of  interest  to  invert  the  relations  in  F.q  (29  to 
obtain  a  set  of  model-dependent  parameters  gi ‘the  pri¬ 
mary  driving  rate),  gip  (the  dark  driving  rate  arising 
from  noncascading  sources),  and  <m  )  ithc  mean  multi 
plication  per  primaryi  The  large  spatial  extent  of  the 
shower  and  the  small  solid  angle  of  acceptance  of  the 
detector  lead  us  to  expect  that  the  parameters  gif  and 
(m  >  will  not  reflect  the  full  effects  of  the  atmospheric 
cascade  process  Nevertheless,  sufficient  cosmic-ray 
clustering  remains  to  control  these  parameters 


v  v  v  v  v  \  v.‘*\ /,v;  .*,v>  j 
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A 1  -Anc  )=npT  exp(/3)[6expf  2/3 1  —  6  exp(/3)  +  !  ]  . 

(23c) 


These  formulas  are  appropriate  for  the  Poisson-driven 
Yule-Furry  (PDYF)  counting  distribution,  which  has 
been  shown  to  obey  the  recursion  relation1*  10 


(nc) 

P(nc  +  \;T)  = — 2,  Plnc-k'T)Alk) 
"<•  +  1  *  =o 

,  (24a) 

A  Ik)  — Ik  +1  )exp(  — 2/3)[  1  — exp(  — /3)]k  , 

(24b) 

with  the  initial  condition 

PlO,T)  =  exp<  —fipT)  . 

(24c) 

Equation  (24c)  indicates  that  the  probability  of  observing 
zero  cascade  counts  is  simply  the  probability  of  arrival 
of  zero  primaries  in  the  counting  interval.  Note  also 
that  as  /3— • 0  this  recursion  relation  reduces  to  that  for 
the  Poisson  law  of  the  primaries. 

Finally,  we  consider  the  case  of  logarithmic  multipli¬ 
cation  with  statistics  given  by  Eq.  (12).  Utilizing  Eq. 
(20)  once  more  and  using  the  transformation 

( nr,  >  =/ir  T(exp(  0 )  —  l]/0  , 

(25a) 

M=fipT/e . 

(25b) 

we  find 

( nc  )  =  <«,, )  , 

(26a) 

I'(nc)=(rty)[l— /M  1 1  , 

(26bl 

.Vfji  nr  )=  (n0  )[  1  —  i  ( n ,, )  /M  >][  1  ■+  2(  (  n0  )  /M  i] 

|  ,  (26c ) 

to  be  the  representative  statistics  of  the  Poisson-driven 
logarithmic  distribution  This  turns  out  to  be  none  oth¬ 
er  than  the  negative-binomial  counting  distribution,14 
with  mean  (n:,  )  and  degrees-of-freedom  parameter  M: 

Plnc\Ti 


^  n  c> ) 

1  r 

,+  .« 

\ 

a  I 

(27) 

as  first  shown  by  QuenGuille.^1  Interestingly  enough,  the 
negative-binomial  distribution  also  arises,  in  the  context 
of  cascade  theory,  as  the  steady-state  limit  of  the  prob¬ 
lem  of  birth,  death,  and  immigration  with  stationary 
rates.21  The  result  is  independent  of  the  number  of  pri¬ 
mary  particles  and  is  only  valid  if  the  death  rate  exceeds 
the  birth  rate;  the  limit  thus  represents  a  balance  be¬ 
tween  the  decaying  cascade  population  and  the  immigia- 
tion  rate.  The  Poisson-driven  Yule-Furry  and  the 
negative-binomial  distributions  are  both  two-parameter 
distributions. 

III.  EXPERIMENTS 

A.  The  photomultiplier  tube  as  a  detector  of  cosmic  rays 

It  has  long  been  known  that  cosmic-ray-shower  parti¬ 
cles  at  ground  level  are  registered  by  optical  detectors 


such  as  the  photomultiplier  tube  (PMT)  The  underlying 
reason  is  understood  from  the  work  of  Cherenkov  in 
1934.  A  relativistic  charged  particle  which  travels  at  a 
speed  exceeding  the  phase  velocity  of  light  in  a  transpar¬ 
ent  medium  can  induce  a  broadband  optical  shock  wave 
with  a  spectral  line  shape  predominant  in  the  visible  and 
ultraviolet  regions  of  the  electromagnetic  spectrum  22  2  ' 
Since  this  Cherenkov  emission  is  highly  directive  and 
efficient,  a  single  charged  energetic  shower  component 
that  traverses  the  thin  photomulttplier-tube  window  gen¬ 
erates  a  dense  packet  of  photons  that  is  subsequently 
amplified  into  a  large  anode  current  pulse. 

Because  the  photomultiplier  cannot  resolve  the  indivi¬ 
dual  photons  from  the  brief  Cherenkov  flash,  which  lasts 
of  the  order  of  tens  of  picoseconds  in  accordance  with 
the  cosmic  ray's  transit  time,  the  output  signal  is  just  the 
device's  impulse  response  with  an  amplitude  proportion¬ 
al  to  the  number  of  Cherenkov-induced  photoelectrons 
(if  saturation  effects  are  not  considered!.  These  large 
pulses  in  fact  account  for  the  pronounced  tails  in  pulse- 
height  distributions  of  the  photomultiplier  back¬ 
ground.24  2t 

Young27  has  shown  that  the  output  current  from  a 
PMT,  operated  in  the  continuous  mode  in  the  dark,  is 
indeed  marked  by  substantial  fluctuations  that  can  seri¬ 
ously  degrade  the  signal-to-noise  ratio  in  a  variety  of  ex¬ 
periments.  As  a  photon  counter,  however,  the  PMT 
offers  a  significant  reduction  in  the  Cherenkov  noise, 
since  each  output  pulse  that  exceeds  an  adjustable 
threshold  height  yields  a  single  count,  regardless  of  its 
excess  amplitude.  The  passage  of  a  cosmic  ray  through 
the  photon-counting  PMT  therefore  registers  one,  and 
only  one,  count,  although  these  background  events  may 
still  be  highly  detrimental  to  some  counting  and  correla¬ 
tion  experiments.  Barring  other  sources  of  PMT  events, 
a  counting  distribution  from  the  device  in  the  dark  'bus 
provides  a  direct  probe  of  the  local  cosmic-ray  activity, 
and,  in  this  capacity,  the  photon  counter  is  the  ideal  tool 
for  extracting  its  clustering  properties. 

Previous  efforts  by  Young  and  others  to  characterize 
cosmic-ray  effects  at  the  photomultiplier  tube  were  in¬ 
tended  primarily  for  dc  and  mean-square  current  appli¬ 
cations  that  do  not  take  full  advantage  of  pulse-height 
discrimination.  The  theoretical  counting  distributions 
considered  in  the  previous  section  should  provide  a  more 
appropriate  and  complete  description  of  the  cosmic-ray 
noise,  since  they  capture  the  essential  clustering  behavior 
of  these  particles  at  ground  level  and  retain  their  count¬ 
ing  statistics  to  all  orders. 

B.  Cosmic-ray  counting  experiments  in  space 

Before  examining  experimental  configurations  at 
ground  level  in  which  Poisson-driven  cascades  play  a 
role,  it  is  useful  to  experimentally  verify  that  the  pri¬ 
mary  cosmic-ray  flux  above  the  atmosphere  does  indeed 
obey  the  Poisson  law. 

Fastie2"  conducted  dark-counting  experiments  in  space 
using  a  Schlumberger  EMR  type  542G-09  solar-blind 
PMT  from  the  ultraviolet  spectrometer  aboard  the  Apol¬ 
lo  17  space-vehicle  command  module  He  concluded 
that  "  .  .  in  lunar  orbit  the  observed  dark  count  varied 
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where 

/(/)  =  exp  |  J"'[X(f')  —  v(r')]</t’ 


(15b) 


(15c) 


where  X(r)  and  v(r)  are  the  birth  and  death  rates,  respec¬ 
tively.  Setting  v(/)  =  0  and  X(/I  =  X  reproduces  the 
Yule-Furry  results  presented  in  Eqs  (8a)  and  (8bi. 

The  temporal  variation  of  these  rates  can  be  appropri¬ 
ately  chosen  to  represent  the  degradation  in  offspring 
production  due  to  the  partitioning  of  energy  among  the 
particles  of  a  developing  cascade  Arley  considered  a 
simple  example  in  which  the  death  rate  is  linearly  in¬ 
fused  [ v* / 1  =  vf ]  while  the  birth  rate  is  held  constant 
[ai|i  =  X )  The  evolution  of  the  mean  multiplication  can 
be  solved  using  Eq  U5a' 


where  the  moment-generating  function  (MGF,  is  defined 
as 


Q,(s)  =  <exp(  -sx)) 

<X 

=  2  exP*  ~sx  >f>u  • T}  •  (18, 

for  a  general  counting  process  x  (Ref.  14) 

We  can  use  the  above  result  to  obtain  a  useful  set  of 
equations  that  expresses  the  mean,  variance,  and  third 
ceritral  moment  of  the  overall  cascade  counting  distribu¬ 
tions  P(n(  ,T\  in  terms  of  those  of  the  constituent  count 
ing  processes: 

(nc)  =  ip)im),  (19a' 

V(  nr  )=  yip  K  m  )2  (p  )  F(m  )  ,  (19b> 

M ,( nf )  =  M  ^tp)i  m  )  ’  +  3  K(p)<  m  )V{m)-rip)M^m'  . 

(19c! 
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(  m  )  =  exp'  Xi,  lexp1  —  vt,J/2l  (16) 

This  is  the  same  result  as  that  for  a  birth  process  that  at¬ 
tenuates  via  the  tail  of  a  zero-mean  Gaussian  envelope 
with  the  standard  deviation  o  =  v  1  /v  Since  more  ener¬ 
getic  primaries  vield  larger  showers,  the  Gaussian  modu¬ 
lation  car,  further  be  made  dependent  on  the  total  pn- 
r-  ■  energv  hv  proper  adjustment  of  the  death-rate 

Apparent!)  the  simple  geometric  character  of  the 
>ule  Furry  pure-hirth  solution  offers  the  essential  in¬ 
gredient  f  r  the  characterization  of  cosmic-ray  showers, 
regardless  of  the  level  of  sophistication  incorporated  into 
the  cascade  model  Arlev  appreciated  this  and  suggested 
that  the  region  of  validity  of  the  simplifying  ap¬ 

proximations.  of  which  any  theory  must  necessarily 
make  use  is  always  far  wider  than  might  be  justified  by 
theoretual  arguments  " 

Mathematically,  cascade  counting  distributions  sup¬ 
port  a  multiplication  variance  term  that  is  proportional 
to  the  square  of  the  mean  This  is  evident  in  Eq  (15b) 
for  arbitrary  birth  and  death  rates.  In  the  hierarchy  of 
discrete  probability  distributions,  the  Bose-Einstein  dis¬ 
tribution  (as  well  as  its  shifted  version,  the  Yule-Furryi 
provides  the  simplest  example  cf  mean-square  excess 
fluctuations  Our  intuition  regarding  the  logarithmic 
description  of  the  multiplication  is  based  on  the  presence 
of  a  similar  dependence,  as  can  be  inferred  from  Eq. 
( 1 2b). 

B.  Showers  initiated  by  a  Poisson  number  of  primaries 

We  now  generalize  the  results  of  the  previous  section 
by  considering  a  number  of  primary  events  p,  which  is 
randomized  via  a  suitable  counting  distribution  P(p,Tt. 
Since  each  primary  event  independently  multiplies  ac¬ 
cording  to  a  model  counting  distribution  P(m;T),  the 
overall  cascade  is  an  example  of  a  compound  process 
with  well-known  properties.  In  particular,  the  moment- 
generating  function  2r(s)  of  the  overall  cascade  counting 
distribution  can  be  obtained  by  nesting  those  of  the  pri¬ 
mary  and  multiplication  counting  distributions  accord¬ 
ing  to 


These  results  are  quite  general  ana  are  explicitly  de¬ 
rived  in  Appendix  B  The  former  two  expressions  were 
first  proved  by  Shockley  and  Pierce,15  and  the  second  is 
known  as  the  cascade  variance  theorem. 16  Equation 
(19c)  is  an  extension  to  third-order  statistics;  higher- 
order  theorems  are  discussed  via  an  equivalent  method 
by  Shi h 1 7  in  his  treatment  of  hadronic  multiplicity  distri¬ 
butions. 

If  we  now  specify  that  the  primaries  arrive  in  station¬ 
ary  (or  homogeneous)  Poisson  form,  Eq.  (19)  simplifies 
considerably.  We  denote  the  mean  of  the  primary  Pois¬ 
son  distribution  (p)  as  ppT,  where  generally  p  is  the 
rate  parameter  of  the  homogeneous  Poisson  point  pro¬ 
cess  (HPP)  and  T  is  the  counting  time  over  which  we  ob- 
serve  the  events.  For  the  Poisson  primaries,  we  then 
have  ip  >  =  Vlp)  =  M-s(pl=ppT,  whereupon 


<  n,  )  =  pp  Tim  >  , 

(20a ) 

V(nc)  =  ppT[im  >2-t-K(m)]  , 

(20b> 

M^(nr)  =  ppT[im  >  ’  +  3 (m  )klml-t-Wi(mi 

]  .  (20c) 

This  can  be  compactly  rearranged  in  terms 
nary  moments  as 

of  the  ordi- 

(  nc)  =  ppTi  m  )  , 

(21a 

F(nr  )=pP  Tim2)  , 

(21b) 

)=pp  Tim 1 )  . 

(21c 

The  Fano  factor  is  given  by 

Fc(  T)—  i m  )  +F„(Tt  , 

(22) 

and  depends  solely  on  the  multiplication  statistics 
For  the  case  of  a  primary  Poisson  counting  distribu¬ 
tion  with  Yule-Furry  multiplication,  we  use  Eq  (81  ,n 
Eq.  (20)  to  provide  the  overall  cascade  statistics,  up  to 
third  order 

(  nr )  -pteT exp(/3l  , 

F(n,  )~ppT  exp(0i[  2  ex  p(/3 :  -  1  ]  . 
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Pim;T)=i\/8Hqm/m),  0<q<),m>\,  (10a) 

0=ln[l/(l-9>]  ,  (lObi 

where  0  is  a  normalization  factor.  We  can  rewrite  this 
distribution,  in  terms  of  6,  into  the  suggestive  form 

P(m  ;D  =  [1  —  exp(  —  0)]m/0m,  8>  0,  m>l  ,  ill) 

with  statistics,  up  to  third  order, 

<m  >  =  (l/0)[exp(0)-l]  ,  (12a) 

Vim  )  =  (m  )  +  (0  —  1 )( m  )2  ,  (12b) 


Mi(m)  =  (m  )  +  3(0—  1  )<  m  )2  +  (20:  —  30  +  2 )( m  )3  . 

(12c) 

The  corresponding  Fano  factor  is 

Fm ( D  =  1  +[(0  —  1  )/0][exp(0)  —  1  ]  .  (13) 

The  striking  similarity  between  the  Yule-Furry  and 
logarithmic  statistics  is  not  accidental.  As  shown  in  Ap¬ 
pendix  A,  these  counting  distributions  are  in  fact  related 
when  0=0,  although  statistical  equality  does  not  follow 
from  this  condition  [as  is  evident  from  Eqs.  (8a)  and 
(12a)].  Nevertheless,  we  may  mimic  the  evolution  of  the 
Yule-Furry  mean  multiplication  by  setting 

( m  )  =  1 1  /0)[exp(0l-  1  ]  =  exp(  y )  ,  (14) 

which  is  a  transcendental  equation  for  8—fiy).  The 
dependence  of  the  Fano  factor  on  y  cannot  be  deter¬ 
mined  analytically,  although  this  poses  no  difficulty  since 
for  a  given  value  of  y  we  may  obtain  the  corresponding 
value  of  0  (and,  hence,  the  Fano  factor)  from  a  numeri¬ 
cal  table. 

The  advantage  of  this  technique  rests  on  the 
mathematical  equivalence  of  the  parameter  y  to  a  and  0, 
which  allows  us  to  present  the  associated  Fano-factor 
dependencies  together  for  comparison  in  Fig  1(a).  In 
contrast  with  the  asymptotic  Fano  factor  of  the  shifted- 
Poisson  counting  distribution,  the  Fano  factors  for  the 
Yule-Furry  and  logarithmic  counting  distributions  fol¬ 
low  a  similar  trend  and  rise  quickly  with  increasing  mul¬ 
tiplication  The  latter  two  models  thus  allow  us  to  de¬ 
scribe  the  clustered  behavior  of  cosmic-ray  showers  for 
large  multiplication  with  far  greater  accuracy  than  the 
former  The  principal  underlying  reason  is  the  associa¬ 
tion  of  parents  to  their  offspring  at  each  branching  stage 
of  the  multiplicative  process. 

Representative  probability  distributions  are  presented 
in  Fig  Kb i  for  ( m  )  =  1.005,  1 .2,  and  3.0.  Note  that  in 
this,  and  all  counting  distributions  to  follow,  the  theoret¬ 
ical  values  are  shown  as  continuous  curves  for  clarity, 
but  they  have  significance  only  at  the  discrete  values  of 
the  count  index  It  is  clear  that  for  fixed  multiplication 
the  logarithmic  distribution  exhibits  greater  variance 
than  the  Yule-Furry  distribution;  however,  all  three 
models  converge  to  an  equivalent  description  for  very 
low  multiplication  This  is  also  apparent  in  Fig.  1(a). 

W'e  conclude  this  section  by  speculating  on  the  validi¬ 
ty  of  the  Yule-Furry  distribution  in  describing  the  soft 
component  of  the  cosmic-ray  progeny.  Clearly  the  mod- 


FIG.  1.  (a)  Multiplication  Fano  factor  FmiT)  =  Vim)/(m  ) 
versus  branching  parameter  for  three  models  of  a  cascade  that 
is  initiated  by  a  single  primary  particle.  The  Fano  factors  of 
the  Yule-Furry  (solid  curve)  and  logarithmic  (dashed  curve) 
counting  distributions  increase  rapidly  with  branching  parame¬ 
ter,  in  contrast  with  the  bounded  Fano  factor  of  the  shifted- 
Poisson  (dashed-dolled  curve)  counting  distribution.  The 
former  two  models  thus  provide  a  more  accurate  description  of 
cosmic-ray  shower  fluctuations  (b)  Representative  multiplica¬ 
tion  counting  distributions  Pim;T)  versus  number  of  multipli¬ 
cation  events  m  for  the  models  of  (a)  with  <m  >=  I  005,  1.2, 
and  3.0.  Although  all  three  models  are  equivalent  at  low  mean 
multiplication,  the  Yule-Furry  and  logarithmic  differ  substan¬ 
tially  from  the  shifted-Poisson  counting  distribution  as  <m> 
increases  The  tails  of  distributions  for  the  former  two  models 
become  much  more  developed  than  the  latter,  thus  incorporat¬ 
ing  cascades  with  a  larger  number  of  events. 


el  represents  only  a  first-order  idealization  of  the  prob¬ 
lem  since  it  considers  only  a  single  particle  species 
(hence,  not  accounting  for  the  alternating  generations  of 
electrons  and  photons  in  the  physical  cascade),  and,  as  a 
pure-birth  process,  it  does  not  incorporate  particle  losses 
and  immigration. 

Extensions  of  the  Yule-Furry  have  been  considered  by 
Arley"  and  Srinivasan12  in  their  authoritative  mono¬ 
graphs.  The  framework  for  more  sophisticated  cascade 
treatments  can  be  constructed  from  the  general  problem 
of  birth,  death,  and  immigration  with  time-dependent 
rates.1'  When  no  immigration  is  allowed,  the  mean  and 
variance  of  the  multiplication  can  be  written  more  suc¬ 
cinctly,  yet  with  no  loss  of  generality,  as 
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that  a  primary  cosmic  ray  collisionally  excites  only  a  sin¬ 
gle  shower 

We  employ  the  Poisson-driven  Yule-Furry  and 
negative-binomial  counting  probability  distributions, 
which  are  derived  in  the  context  of  cascade  theory  in  the 
next  section,  to  describe  the  fluctuations  of  cosmic-ray 
showers  at  ground  level  To  demonstrate  the  efficacy  of 
these  models,  we  present  in  Sec  III  experimental  count¬ 
ing  distributions,  which  were  generated  in  the  dark  from 
ordinary  photomultiplier  tubes,  over  a  broad  range  of 
counting  times  and  experimental  durations. 

II.  THEORY  OF  COSMIC-RAY-SHOWER  STATISTICS 

A.  Showers  initiated  by  a  single  primary  event 

The  first  statistical  treatment  of  a  single-primary-event 
cascade  as  applied  to  cosmic  rays  was  presented  by 
Bhabha  and  Heitler4  in  the  United  Kingdom  and  simul¬ 
taneously  by  Carlson  and  Oppenhcimer'  in  the  United 
States.  The  model  is  equivalent  to  a  zero-memory  pro¬ 
cess  whereby  daughter  particles  are  independently  added 
to  the  cascade.  In  a  counting  experiment,  this  model 
leads  to  a  Poisson-distributed  number  of  secondaries.6 
i.e., 

P(k \T)=(k  >*exp(  — <A  >)/*!,  k>0.  (It 

The  angular  brackets  denote  an  ensemble  average  and  T 
is  the  counting  time,  which  is  chosen  to  be  sufficiently 
long  to  capture  the  entire  cluster  of  secondary  events,  so 
their  nonstationary  production  can  be  effectively  treated 
as  instantaneous.  In  other  words,  if  ts  is  the  temporal 
spread  of  the  cluster,  then  the  requirement  T  »(,  en¬ 
sures  that  no  events  are  lost  to  the  counting  process. 
We  assume  this  to  hold  henceforth. 

Clearly  the  conservation  laws  that  link  the  partici¬ 
pants  in  the  pair-production  and  bremsstrahlung  pro¬ 
cesses  would  appear  to  suggest  that  the  addition  of  new 
individuals  to  tbc  cascade  in  an  uncorrelated  manner  is 
HMdeauate.  W^»  have  included  the  Potsson  law  in  our 
discussion  because  of  its  broad  applicability  and  Us 
unique  and  fundamental  property 

(k  )  =  V(k)  =  M,(k)  .  (2) 

Here  V  and  Af(  denote  the  variance  and  third  central 
moment  of  the  distribution,  respectively.  We  employ  the 
Fano  factor,  defined  as  the  vanance-lo-mean  ratio,  as  an 
elementary  measure  of  event  dispersion.8  The  Fano  fac¬ 
tor  Fk  ( T )  of  the  above  Poisson  counting  distribution 
provides  a  convenient  standard  for  comparison  with  oth¬ 
er  distributions,  as  it  is  simply  given  by 

Fkm=  Y(k)/(k  >=  I  .  (3) 

The  inclusion  of  the  primary  particle  in  the  shower 
statistics  can  be  easily  effected  by  a  unit  translation  of 
the  counting  distribution  of  the  secondaries 

P(m,T\  =  P(k-m  -I ;T),  (4) 

yielding,  in  the  above  case,  a  shifted-Poisson  (one- 
parameterl  distribution  for  the  shower  size  or  multiplica¬ 
tion  m.  Thus, 


Pim  ;T)=  <k  )"  ‘exp!  -  <  k  )  )/(m  -  1 )!  , 

m  >  I  ,  ( 5 1 

where  the  mean  value  of  the  shifted  distribution, 
( m  )  =  <  k  >  +  1 ,  reflects  the  deterministic  addition  of  the 
primary  particle  to  the  fluctuating  number  of  secondary 
events.  Since  the  count  variance  is  unchanged,  the  Fano 
factor  becomes 

I  7~l  =  ( (  m  )  —  1  )/<  m  )  =  1-  exp(  -  a  )  , 

a  >  0  ,  ( 6 ' 

where  we  have  set  (m  >=exp(a)  The  reason  for  this 
choice  of  notation  will  soon  be  apparent  Note  that  the 
Fano  factor  now  approaches  unity  asymptotically  with 
increasing  a  as  the  primary's  determinism  becomes 
largely  obscured  by  the  growing  fluctuations  of  secon¬ 
dary  counts 

The  assumptions  of  this  simple  model  were  challenged 
by  Furry*'  soon  after  their  publication  In  his  landmark 
paper.  Furry  considered  a  pure-b>rth  model  whereby 
each  cascade  particle  may  directly  breed  daughters  via  a 
fixed  birth  rate  k.  The  multiplication  then  follows  a 
shifted-geometric  (shifted-Bose-Einsteinl  one-parameter 


counting  distribution 

P(m  ; T)  =  exp(  -/?)[  1  - expl  -p)]"' 

P  >  0.  m  >  1  .  17 i 

The  statistics  up  to  third  order  are 

(m  )  =exp(/3)  ,  (8a 1 

y  (rn  )  =  (  m  )!  -  (  m  )  ,  (8b 1 

W)lml  =  fm  )-3(m  )!  +  2(m  )'  ;  (8c1 

the  Fano  factor  is,  therefore, 

F„(  D  =  exp(fi)-  1  (9) 


Here  the  unitless  branching  parameter  P-klt  is  the 
product  of  the  growth  rate  k  and  the  clusters  playout 
time  t,.  Hence  when  (s=0  there  is  no  branching,  since 
only  the  primary  event  is  present,  but  the  mean  multiple 
cation  subsequently  grows  exponentially  with  r,  The 
quantity  a  in  Eq.  (6)  can  now  be  viewed  as  an  equivalent 
branching-process  parametrization  of  the  shifted  Poisson 
distribution. 

The  above  counting  distribution  was  independently 
derived  earlier  by  Yule  in  connection  with  his  elegant 
mathematical  treatment  of  evolution  10  An  exact  parai 
lei  follows  if  we  identify  the  shower  population  as  the 
number  of  species  within  a  genus  and  if  wt  let  the 
branching  mechanism  represent  random  mutations  The 
shifted -Bose- Einstein  counting  distribution  is,  therefore, 
also  known  as  the  Yule-Furry  distribution,  and  the  point 
process  from  which  it  is  derived  is  termed  a  Yule  Furrv 
process. 

Finally,  we  examine  the  use  of  a  related  model  whose 
mathematical  properties  resemble  those  of  the  Yule 
Furry  distribution.  This  is  the  one-parameter  logarith 
mic  counting  distribution 
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The  fluctuations  of  cosmic-ray  particles  resulting  from  extensive  air  showers  at  ground  level  are 
well  described  by  the  two-parameter  Poisson-dnven  Yule-Furry  and  negative-binomial  counting 
probability  distributions  The  background  signal  from  a  single  photomultiplier  tube  has  been  used 
to  experimentally  verify  these  results  with  remarkable  precision,  in  spite  of  the  simplicity  of  the 
underlying  pure-birth  stochastic  process.  Counting  distributions  from  three  different 
photnmulnplier-iube  detectors  operated  in  the  dark  are  presented,  together  with  the  theoretical 
predictions  Probability  distributions  of  interevent  times  have  also  been  obtained  and  these  are 
found  to  he  consistent  with  the  observed  clustering  properties  at  the  detector  output  Our  ap¬ 
proach  is  expected  to  be  of  importance  in  quantum  optics  where  cosmic-ray-shower  particles  can 
pose  a  significant  limitation  on  the  detection  of  squeezed  light. 


i  iviRoin  rrioN 

The  rich  h i s t . > r i . .« •'  lcg.K  v  embodied  by  cosmic-ray 
research  began  in  I'M;  when  the  extraterrestrial  origin 
of  ionizing  raJutior.  in  the  atmosphere  was  inferred 

from  the  pioneering  b.,;i,.,.n  experiments  of  Victor  Hess 
By  recording  ther  track-  in  triggered  cloud  chambers. 
Rossi1  and  others  deni-  risrratcd  that  some  cosmic  rays 
at  ground  level  van  initiate  cascades  and  are  stopped  in 
thin  lead  pl.i'e  wt.  ir  -ther-  continue  to  penetrate 

through  much  ihi.v  ■  •  lasers  nt  the  same  material 
without  multiple  ii’ivr  eff-  t-  hells e  the  designations  soft 
and  hard  c orr-p.-nev -  rrspe  t-veiv  The  penetrating 
power  of  thr  i.i . •  •  a  -  •  •  ■<  •In-  first  time  the  ex¬ 
istence  of  carnet.  1  ,,  'mi.  energies  and  heralded  the 

search  for  new  nrnin  •  r  ,  j  attn  le-  thus  opening  an  im¬ 
portant  chapie-  o  -ti  !•■'  pli..-, 

The  tate  o*  pt  .■  r  ,  particles  from  space 

determines  the  .1i,a  t...  -  hrh.ivi.ir  in  lead,  as  is 

now  well  undn  !•>  jr-inarv  panicles  -consisting 

mostly  of  pr  a  i  i-  s..  ■  an,  ■  entering  Earth's  at¬ 
mosphere  and  it  ay-  •"  metasuhii  species  such  as 

rr  mesons  wt  ...  "*  >akawa  strong  force 

beyond  the  aiti-n  ,  I'  n.  •  captured,  the 

charged  pior  -  ,i--  i  -  ■  mesons  that  col¬ 
lide  infrequer  •  o  ,-:-.iitul  as  the 

penetrating  face-  ■  t . a • . ■  Muons  may 

also  disinteg' i'  time  span  into 

electrons  or  p.-s  ••  -  .  .  *-•*  their  initial 

charge  state  I'  -  •  f. •  ■  hand,  arc- 

much  more  uno  i'  •  ,  ■  t  counterparts 


and  decay  swiftly  into  photon  pairs. 

These  resulting  photons  and  the  less  abundant  muon- 
dertved  electrons  are  seeds  of  the  atmospheric  avalanche 
effect,  and,  thus,  collectively  encompass  the  multiplica¬ 
tive  nature  of  cosmic  rays  A  single  high-energy  photon 
in  matter  may  materialize  into  an  electron-positron  pair, 
whose  members  in  turn  radiate  further  photons  through 
bremsstrahlung  collisions.  A  cyclic  mechanism  is  thus 
available  to  initiate  and  sustain  an  electromagnetic  cas¬ 
cade  where  the  energy  of  the  source  particle  is  parti¬ 
tioned  among  its  offspring  until  the  individual  share  is 
low  enough  for  Compton  scattering,  ionization  losses, 
and  other  processes  to  dominate  and  dilute  the 
avalanche  If  the  source  is  sufficiently  energetic,  howev¬ 
er,  the  family  survives  down  to  ground  level  where  it  can 
be  detected  as  an  extensive  air  shower.' 

It  is  the  statistical  behavior  of  this  cascading  com¬ 
ponent  of  the  cosmic-ray  progeny  that  is  of  principal  in¬ 
terest  in  this  paper.  We  focus  on  the  fluctuations  in 
shower  size  that  arise  from  the  inherent  randomness  of 
the  multiplication  processes,  rather  than  on  the  distribu¬ 
tion  of  secondary-particle  energies.  The  mathematical 
description  is  rooted  in  stochastic  branching-process 
theory 

From  the  unfolding  history  of  a  primary  cosmic  ray  in 
the  atmosphere,  we  can  clearly  identify  at  least  three  lev¬ 
els  of  randomness.  They  represent  the  fluctuations  in 
primary  arrivals,  in  the  number  of  cascade-initiating 
sources  per  primary,  and  in  the  population  of  each  such 
cascade.  For  greatest  simplicity,  however,  we  eliminate 
the  intermediate  level  of  random  behavior  and  assume 
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